A C D E F G I L M N O P R S T U W X 
All Classes All Packages

A

addCapturedContent() - Method in class org.apache.solr.handler.extraction.SolrContentHandler
Add the per field captured content to the Solr Document.
addContent() - Method in class org.apache.solr.handler.extraction.SolrContentHandler
Add in the catch all content to the field.
addField(String, String, String[]) - Method in class org.apache.solr.handler.extraction.SolrContentHandler
 
addLiterals() - Method in class org.apache.solr.handler.extraction.SolrContentHandler
Add in the literals to the document using the SolrContentHandler.params and the LITERALS_PREFIX.
addMetadata() - Method in class org.apache.solr.handler.extraction.SolrContentHandler
Add in any metadata using SolrContentHandler.metadata as the source.
autoDetectParser - Variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
 

C

CAPTURE_ATTRIBUTES - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Capture attributes separately according to the name of the element, instead of just adding them to the string buffer
CAPTURE_ELEMENTS - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Capture the specified fields (and everything included below it that isn't capture by some other capture field) separately from the default.
captureAttribs - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
 
catchAllBuilder - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
 
characters(char[], int, int) - Method in class org.apache.solr.handler.extraction.SolrContentHandler
 
config - Variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
 
config - Variable in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 
CONFIG_LOCATION - Static variable in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 
contentFieldName - Static variable in class org.apache.solr.handler.extraction.SolrContentHandler
 
create() - Method in class org.apache.solr.handler.extraction.ParseContextConfig
 
createFactory() - Method in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 
createSolrContentHandler(Metadata, SolrParams, IndexSchema) - Method in class org.apache.solr.handler.extraction.SolrContentHandlerFactory
 

D

DEFAULT_FIELD - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Optional.
defaultField - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
 
document - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
 

E

endElement(String, String, String) - Method in class org.apache.solr.handler.extraction.SolrContentHandler
 
EXTRACT_FORMAT - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Content output format if extractOnly is true.
EXTRACT_ONLY - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Only extract and return the content, do not index it.
ExtractingDocumentLoader - Class in org.apache.solr.handler.extraction
The class responsible for loading extracted content into Solr.
ExtractingDocumentLoader(SolrQueryRequest, UpdateRequestProcessor, TikaConfig, ParseContextConfig, SolrContentHandlerFactory) - Constructor for class org.apache.solr.handler.extraction.ExtractingDocumentLoader
 
ExtractingDocumentLoader.MostlyPassthroughHtmlMapper - Class in org.apache.solr.handler.extraction
 
ExtractingMetadataConstants - Interface in org.apache.solr.handler.extraction
Constants used internally by the ExtractingRequestHandler.
ExtractingParams - Interface in org.apache.solr.handler.extraction
The various Solr Parameters names to use when extracting content.
ExtractingRequestHandler - Class in org.apache.solr.handler.extraction
Handler for rich documents like PDF or Word or any other file format that Tika handles that need the text to be extracted first from the document.
ExtractingRequestHandler() - Constructor for class org.apache.solr.handler.extraction.ExtractingRequestHandler
 

F

factory - Variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
 
factory - Variable in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 
fieldBuilders - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
 
findMappedName(String) - Method in class org.apache.solr.handler.extraction.SolrContentHandler
Get the name mapping

G

getContentType(SolrQueryRequest, SolrQueryResponse) - Method in class org.apache.solr.handler.extraction.XLSXResponseWriter
 
getDescription() - Method in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 
getExplicitPassword() - Method in class org.apache.solr.handler.extraction.RegexRulesPasswordProvider
Gets the explicit password, if set
getPassword(Metadata) - Method in class org.apache.solr.handler.extraction.RegexRulesPasswordProvider
 
getPasswordMap() - Method in class org.apache.solr.handler.extraction.RegexRulesPasswordProvider
 
getPermissionName(AuthorizationContext) - Method in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 

I

ignorableWhitespace(char[], int, int) - Method in class org.apache.solr.handler.extraction.SolrContentHandler
Treat the same as any other characters
IGNORE_TIKA_EXCEPTION - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
if true, ignore TikaException (give up to extract text but index meta data)
inform(SolrCore) - Method in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 
INSTANCE - Static variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader.MostlyPassthroughHtmlMapper
 
isDiscardElement(String) - Method in class org.apache.solr.handler.extraction.ExtractingDocumentLoader.MostlyPassthroughHtmlMapper
Keep all elements and their content.

L

LITERALS_OVERRIDE - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Literal field values will by default override other values such as metadata and content.
LITERALS_PREFIX - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Pass in literal values to be added to the document, as in
load(SolrQueryRequest, SolrQueryResponse, ContentStream, UpdateRequestProcessor) - Method in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
 
lowerNames - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
 
LOWERNAMES - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Map all generated attribute names to field names with lowercase and underscores.

M

MAP_PREFIX - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
The param prefix for mapping Tika metadata to Solr fields.
mapSafeAttribute(String, String) - Method in class org.apache.solr.handler.extraction.ExtractingDocumentLoader.MostlyPassthroughHtmlMapper
Lowercases the attribute name
mapSafeElement(String) - Method in class org.apache.solr.handler.extraction.ExtractingDocumentLoader.MostlyPassthroughHtmlMapper
Lowercases the element name, but returns null for <BR>, which suppresses the start-element event for lt;BR> tags.
metadata - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
 
MostlyPassthroughHtmlMapper() - Constructor for class org.apache.solr.handler.extraction.ExtractingDocumentLoader.MostlyPassthroughHtmlMapper
 

N

newDocument() - Method in class org.apache.solr.handler.extraction.SolrContentHandler
This is called by a consumer when it is ready to deal with a new SolrInputDocument.
newLoader(SolrQueryRequest, UpdateRequestProcessor) - Method in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 

O

org.apache.solr.handler.extraction - package org.apache.solr.handler.extraction
ExtractingRequestHandler and related code.

P

params - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
 
parse(InputStream) - Method in class org.apache.solr.handler.extraction.RegexRulesPasswordProvider
Initialize rules through file input stream.
PARSE_CONTEXT_CONFIG - Static variable in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 
parseContextConfig - Variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
 
parseContextConfig - Variable in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 
ParseContextConfig - Class in org.apache.solr.handler.extraction
 
ParseContextConfig() - Constructor for class org.apache.solr.handler.extraction.ParseContextConfig
Creates an empty Config without any settings (used as placeholder).
ParseContextConfig(SolrResourceLoader, String) - Constructor for class org.apache.solr.handler.extraction.ParseContextConfig
Creates a ParseContextConfig from the given XML file, loaded from the given SolrResourceLoader.
ParseContextConfig(SolrResourceLoader, Element) - Constructor for class org.apache.solr.handler.extraction.ParseContextConfig
Creates a ParseContextConfig from the given XML DOM element.
parseRulesFile(InputStream) - Static method in class org.apache.solr.handler.extraction.RegexRulesPasswordProvider
Parses rule file from stream and returns a Map of all rules found
PASSWORD_MAP_FILE - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Optional.

R

RegexRulesPasswordProvider - Class in org.apache.solr.handler.extraction
Password provider for Extracting request handler which finds correct password based on file name matching against a list of regular expressions.
RegexRulesPasswordProvider() - Constructor for class org.apache.solr.handler.extraction.RegexRulesPasswordProvider
 
resetExplicitPassword() - Method in class org.apache.solr.handler.extraction.RegexRulesPasswordProvider
Resets explicit password, so that map will be used for lookups
RESOURCE_NAME - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Optional.
RESOURCE_PASSWORD - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Optional.

S

schema - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
 
setExplicitPassword(String) - Method in class org.apache.solr.handler.extraction.RegexRulesPasswordProvider
Sets an explicit password which will be used instead of password map
setPasswordMap(LinkedHashMap<Pattern, String>) - Method in class org.apache.solr.handler.extraction.RegexRulesPasswordProvider
 
SolrContentHandler - Class in org.apache.solr.handler.extraction
The class responsible for handling Tika events and translating them into SolrInputDocuments.
SolrContentHandler(Metadata, SolrParams, IndexSchema) - Constructor for class org.apache.solr.handler.extraction.SolrContentHandler
 
SolrContentHandlerFactory - Class in org.apache.solr.handler.extraction
 
SolrContentHandlerFactory() - Constructor for class org.apache.solr.handler.extraction.SolrContentHandlerFactory
 
startElement(String, String, String, Attributes) - Method in class org.apache.solr.handler.extraction.SolrContentHandler
 
STREAM_CONTENT_TYPE - Static variable in interface org.apache.solr.handler.extraction.ExtractingMetadataConstants
 
STREAM_NAME - Static variable in interface org.apache.solr.handler.extraction.ExtractingMetadataConstants
 
STREAM_SIZE - Static variable in interface org.apache.solr.handler.extraction.ExtractingMetadataConstants
 
STREAM_SOURCE_INFO - Static variable in interface org.apache.solr.handler.extraction.ExtractingMetadataConstants
 
STREAM_TYPE - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
The type of the stream.

T

TEXT_FORMAT - Static variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
Extract Only supported format

U

UNKNOWN_FIELD_PREFIX - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Optional.
unknownFieldPrefix - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
 

W

write(OutputStream, SolrQueryRequest, SolrQueryResponse) - Method in class org.apache.solr.handler.extraction.XLSXResponseWriter
 

X

XLSXResponseWriter - Class in org.apache.solr.handler.extraction
 
XLSXResponseWriter() - Constructor for class org.apache.solr.handler.extraction.XLSXResponseWriter
 
XML_FORMAT - Static variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
Extract Only supported format.
XPATH_EXPRESSION - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Restrict the extracted parts of a document to be indexed by passing in an XPath expression.
A C D E F G I L M N O P R S T U W X 
All Classes All Packages