A C D E F G I L M N O P R S T U W X
All Classes All Packages
All Classes All Packages
All Classes All Packages
A
- addCapturedContent() - Method in class org.apache.solr.handler.extraction.SolrContentHandler
-
Add the per field captured content to the Solr Document.
- addContent() - Method in class org.apache.solr.handler.extraction.SolrContentHandler
-
Add in the catch all content to the field.
- addField(String, String, String[]) - Method in class org.apache.solr.handler.extraction.SolrContentHandler
- addLiterals() - Method in class org.apache.solr.handler.extraction.SolrContentHandler
-
Add in the literals to the document using the
SolrContentHandler.params
and theLITERALS_PREFIX
. - addMetadata() - Method in class org.apache.solr.handler.extraction.SolrContentHandler
-
Add in any metadata using
SolrContentHandler.metadata
as the source. - autoDetectParser - Variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
C
- CAPTURE_ATTRIBUTES - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
-
Capture attributes separately according to the name of the element, instead of just adding them to the string buffer
- CAPTURE_ELEMENTS - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
-
Capture the specified fields (and everything included below it that isn't capture by some other capture field) separately from the default.
- captureAttribs - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
- catchAllBuilder - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
- characters(char[], int, int) - Method in class org.apache.solr.handler.extraction.SolrContentHandler
- config - Variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
- config - Variable in class org.apache.solr.handler.extraction.ExtractingRequestHandler
- CONFIG_LOCATION - Static variable in class org.apache.solr.handler.extraction.ExtractingRequestHandler
- contentFieldName - Static variable in class org.apache.solr.handler.extraction.SolrContentHandler
- create() - Method in class org.apache.solr.handler.extraction.ParseContextConfig
- createFactory() - Method in class org.apache.solr.handler.extraction.ExtractingRequestHandler
- createSolrContentHandler(Metadata, SolrParams, IndexSchema) - Method in class org.apache.solr.handler.extraction.SolrContentHandlerFactory
D
- DEFAULT_FIELD - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
-
Optional.
- defaultField - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
- document - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
E
- endElement(String, String, String) - Method in class org.apache.solr.handler.extraction.SolrContentHandler
- EXTRACT_FORMAT - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
-
Content output format if extractOnly is true.
- EXTRACT_ONLY - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
-
Only extract and return the content, do not index it.
- ExtractingDocumentLoader - Class in org.apache.solr.handler.extraction
-
The class responsible for loading extracted content into Solr.
- ExtractingDocumentLoader(SolrQueryRequest, UpdateRequestProcessor, TikaConfig, ParseContextConfig, SolrContentHandlerFactory) - Constructor for class org.apache.solr.handler.extraction.ExtractingDocumentLoader
- ExtractingDocumentLoader.MostlyPassthroughHtmlMapper - Class in org.apache.solr.handler.extraction
- ExtractingMetadataConstants - Interface in org.apache.solr.handler.extraction
-
Constants used internally by the
ExtractingRequestHandler
. - ExtractingParams - Interface in org.apache.solr.handler.extraction
-
The various Solr Parameters names to use when extracting content.
- ExtractingRequestHandler - Class in org.apache.solr.handler.extraction
-
Handler for rich documents like PDF or Word or any other file format that Tika handles that need the text to be extracted first from the document.
- ExtractingRequestHandler() - Constructor for class org.apache.solr.handler.extraction.ExtractingRequestHandler
F
- factory - Variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
- factory - Variable in class org.apache.solr.handler.extraction.ExtractingRequestHandler
- fieldBuilders - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
- findMappedName(String) - Method in class org.apache.solr.handler.extraction.SolrContentHandler
-
Get the name mapping
G
- getContentType(SolrQueryRequest, SolrQueryResponse) - Method in class org.apache.solr.handler.extraction.XLSXResponseWriter
- getDescription() - Method in class org.apache.solr.handler.extraction.ExtractingRequestHandler
- getExplicitPassword() - Method in class org.apache.solr.handler.extraction.RegexRulesPasswordProvider
-
Gets the explicit password, if set
- getPassword(Metadata) - Method in class org.apache.solr.handler.extraction.RegexRulesPasswordProvider
- getPasswordMap() - Method in class org.apache.solr.handler.extraction.RegexRulesPasswordProvider
- getPermissionName(AuthorizationContext) - Method in class org.apache.solr.handler.extraction.ExtractingRequestHandler
I
- ignorableWhitespace(char[], int, int) - Method in class org.apache.solr.handler.extraction.SolrContentHandler
-
Treat the same as any other characters
- IGNORE_TIKA_EXCEPTION - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
-
if true, ignore TikaException (give up to extract text but index meta data)
- inform(SolrCore) - Method in class org.apache.solr.handler.extraction.ExtractingRequestHandler
- INSTANCE - Static variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader.MostlyPassthroughHtmlMapper
- isDiscardElement(String) - Method in class org.apache.solr.handler.extraction.ExtractingDocumentLoader.MostlyPassthroughHtmlMapper
-
Keep all elements and their content.
L
- LITERALS_OVERRIDE - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
-
Literal field values will by default override other values such as metadata and content.
- LITERALS_PREFIX - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
-
Pass in literal values to be added to the document, as in
- load(SolrQueryRequest, SolrQueryResponse, ContentStream, UpdateRequestProcessor) - Method in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
- lowerNames - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
- LOWERNAMES - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
-
Map all generated attribute names to field names with lowercase and underscores.
M
- MAP_PREFIX - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
-
The param prefix for mapping Tika metadata to Solr fields.
- mapSafeAttribute(String, String) - Method in class org.apache.solr.handler.extraction.ExtractingDocumentLoader.MostlyPassthroughHtmlMapper
-
Lowercases the attribute name
- mapSafeElement(String) - Method in class org.apache.solr.handler.extraction.ExtractingDocumentLoader.MostlyPassthroughHtmlMapper
-
Lowercases the element name, but returns null for <BR>, which suppresses the start-element event for lt;BR> tags.
- metadata - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
- MostlyPassthroughHtmlMapper() - Constructor for class org.apache.solr.handler.extraction.ExtractingDocumentLoader.MostlyPassthroughHtmlMapper
N
- newDocument() - Method in class org.apache.solr.handler.extraction.SolrContentHandler
-
This is called by a consumer when it is ready to deal with a new SolrInputDocument.
- newLoader(SolrQueryRequest, UpdateRequestProcessor) - Method in class org.apache.solr.handler.extraction.ExtractingRequestHandler
O
- org.apache.solr.handler.extraction - package org.apache.solr.handler.extraction
-
ExtractingRequestHandler
and related code.
P
- params - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
- parse(InputStream) - Method in class org.apache.solr.handler.extraction.RegexRulesPasswordProvider
-
Initialize rules through file input stream.
- PARSE_CONTEXT_CONFIG - Static variable in class org.apache.solr.handler.extraction.ExtractingRequestHandler
- parseContextConfig - Variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
- parseContextConfig - Variable in class org.apache.solr.handler.extraction.ExtractingRequestHandler
- ParseContextConfig - Class in org.apache.solr.handler.extraction
- ParseContextConfig() - Constructor for class org.apache.solr.handler.extraction.ParseContextConfig
-
Creates an empty Config without any settings (used as placeholder).
- ParseContextConfig(SolrResourceLoader, String) - Constructor for class org.apache.solr.handler.extraction.ParseContextConfig
-
Creates a
ParseContextConfig
from the given XML file, loaded from the givenSolrResourceLoader
. - ParseContextConfig(SolrResourceLoader, Element) - Constructor for class org.apache.solr.handler.extraction.ParseContextConfig
-
Creates a
ParseContextConfig
from the given XML DOM element. - parseRulesFile(InputStream) - Static method in class org.apache.solr.handler.extraction.RegexRulesPasswordProvider
-
Parses rule file from stream and returns a Map of all rules found
- PASSWORD_MAP_FILE - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
-
Optional.
R
- RegexRulesPasswordProvider - Class in org.apache.solr.handler.extraction
-
Password provider for Extracting request handler which finds correct password based on file name matching against a list of regular expressions.
- RegexRulesPasswordProvider() - Constructor for class org.apache.solr.handler.extraction.RegexRulesPasswordProvider
- resetExplicitPassword() - Method in class org.apache.solr.handler.extraction.RegexRulesPasswordProvider
-
Resets explicit password, so that map will be used for lookups
- RESOURCE_NAME - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
-
Optional.
- RESOURCE_PASSWORD - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
-
Optional.
S
- schema - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
- setExplicitPassword(String) - Method in class org.apache.solr.handler.extraction.RegexRulesPasswordProvider
-
Sets an explicit password which will be used instead of password map
- setPasswordMap(LinkedHashMap<Pattern, String>) - Method in class org.apache.solr.handler.extraction.RegexRulesPasswordProvider
- SolrContentHandler - Class in org.apache.solr.handler.extraction
-
The class responsible for handling Tika events and translating them into
SolrInputDocument
s. - SolrContentHandler(Metadata, SolrParams, IndexSchema) - Constructor for class org.apache.solr.handler.extraction.SolrContentHandler
- SolrContentHandlerFactory - Class in org.apache.solr.handler.extraction
- SolrContentHandlerFactory() - Constructor for class org.apache.solr.handler.extraction.SolrContentHandlerFactory
- startElement(String, String, String, Attributes) - Method in class org.apache.solr.handler.extraction.SolrContentHandler
- STREAM_CONTENT_TYPE - Static variable in interface org.apache.solr.handler.extraction.ExtractingMetadataConstants
- STREAM_NAME - Static variable in interface org.apache.solr.handler.extraction.ExtractingMetadataConstants
- STREAM_SIZE - Static variable in interface org.apache.solr.handler.extraction.ExtractingMetadataConstants
- STREAM_SOURCE_INFO - Static variable in interface org.apache.solr.handler.extraction.ExtractingMetadataConstants
- STREAM_TYPE - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
-
The type of the stream.
T
- TEXT_FORMAT - Static variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
-
Extract Only supported format
U
- UNKNOWN_FIELD_PREFIX - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
-
Optional.
- unknownFieldPrefix - Variable in class org.apache.solr.handler.extraction.SolrContentHandler
W
- write(OutputStream, SolrQueryRequest, SolrQueryResponse) - Method in class org.apache.solr.handler.extraction.XLSXResponseWriter
X
- XLSXResponseWriter - Class in org.apache.solr.handler.extraction
- XLSXResponseWriter() - Constructor for class org.apache.solr.handler.extraction.XLSXResponseWriter
- XML_FORMAT - Static variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
-
Extract Only supported format.
- XPATH_EXPRESSION - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
-
Restrict the extracted parts of a document to be indexed by passing in an XPath expression.
All Classes All Packages