Class ExtractingDocumentLoader
- java.lang.Object
-
- org.apache.solr.handler.loader.ContentStreamLoader
-
- org.apache.solr.handler.extraction.ExtractingDocumentLoader
-
public class ExtractingDocumentLoader extends ContentStreamLoader
The class responsible for loading extracted content into Solr.
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static class
ExtractingDocumentLoader.MostlyPassthroughHtmlMapper
-
Field Summary
Fields Modifier and Type Field Description protected org.apache.tika.parser.AutoDetectParser
autoDetectParser
protected org.apache.tika.config.TikaConfig
config
protected SolrContentHandlerFactory
factory
protected ParseContextConfig
parseContextConfig
static String
TEXT_FORMAT
Extract Only supported formatstatic String
XML_FORMAT
Extract Only supported format.
-
Constructor Summary
Constructors Constructor Description ExtractingDocumentLoader(SolrQueryRequest req, UpdateRequestProcessor processor, org.apache.tika.config.TikaConfig config, ParseContextConfig parseContextConfig, SolrContentHandlerFactory factory)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description void
load(SolrQueryRequest req, SolrQueryResponse rsp, org.apache.solr.common.util.ContentStream stream, UpdateRequestProcessor processor)
-
Methods inherited from class org.apache.solr.handler.loader.ContentStreamLoader
getDefaultWT, init
-
-
-
-
Field Detail
-
TEXT_FORMAT
public static final String TEXT_FORMAT
Extract Only supported format- See Also:
- Constant Field Values
-
XML_FORMAT
public static final String XML_FORMAT
Extract Only supported format. Default- See Also:
- Constant Field Values
-
autoDetectParser
protected org.apache.tika.parser.AutoDetectParser autoDetectParser
-
config
protected org.apache.tika.config.TikaConfig config
-
parseContextConfig
protected ParseContextConfig parseContextConfig
-
factory
protected SolrContentHandlerFactory factory
-
-
Constructor Detail
-
ExtractingDocumentLoader
public ExtractingDocumentLoader(SolrQueryRequest req, UpdateRequestProcessor processor, org.apache.tika.config.TikaConfig config, ParseContextConfig parseContextConfig, SolrContentHandlerFactory factory)
-
-
Method Detail
-
load
public void load(SolrQueryRequest req, SolrQueryResponse rsp, org.apache.solr.common.util.ContentStream stream, UpdateRequestProcessor processor) throws Exception
- Specified by:
load
in classContentStreamLoader
- Throws:
Exception
-
-