public interface TokenizerFactory
TokenizerFactory breaks up a stream of characters
into tokens.
TokenizerFactories are registered for FieldTypes with the
IndexSchema through the schema.xml file.
Example schema.xml entry to register a TokenizerFactory
implementation to tokenize fields of type "cool"
<fieldtype name="cool" class="solr.TextField">
<analyzer>
<tokenizer class="solr.StandardTokenizerFactory"/>
...
A single instance of any registered TokenizerFactory is created via the default constructor and is reused for each FieldType.
| Modifier and Type | Method and Description |
|---|---|
Tokenizer |
create(Reader input)
Creates a TokenStream of the specified input
|
Map<String,String> |
getArgs()
Accessor method for reporting the args used to initialize this factory.
|
void |
init(Map<String,String> args)
init will be called just once, immediately after creation. |
void init(Map<String,String> args)
init will be called just once, immediately after creation.
The args are user-level initialization parameters that may be specified when declaring a the factory in the schema.xml
Map<String,String> getArgs()
Implementations are strongly encouraged to return the contents of the Map passed to to the init method