| Interface | Description |
|---|---|
| CharFilterFactory | |
| MultiTermAwareComponent |
Add to any analysis factory component to allow returning an
analysis component factory for use with partial terms in prefix queries,
wildcard queries, range query endpoints, regex queries, etc.
|
| TokenFilterFactory |
A
TokenFilterFactory creates a
TokenFilter to transform one TokenStream
into another. |
| TokenizerFactory |
A
TokenizerFactory breaks up a stream of characters
into tokens. |
| Class | Description |
|---|---|
| ArabicLetterTokenizerFactory | Deprecated
(3.1) Use StandardTokenizerFactory instead.
|
| ArabicNormalizationFilterFactory |
Factory for
ArabicNormalizationFilter. |
| ArabicStemFilterFactory |
Factory for
ArabicStemFilter. |
| ASCIIFoldingFilterFactory |
Factory for
ASCIIFoldingFilter. |
| BaseCharFilterFactory | |
| BaseTokenFilterFactory |
Simple abstract implementation that handles init arg processing.
|
| BaseTokenizerFactory |
Simple abstract implementation that handles init arg processing.
|
| BeiderMorseFilterFactory |
Factory for
BeiderMorseFilter. |
| BrazilianStemFilterFactory |
Factory for
BrazilianStemFilter. |
| BufferedTokenStream | Deprecated
This class does not support custom attributes.
|
| BulgarianStemFilterFactory |
Factory for
BulgarianStemFilter. |
| CapitalizationFilterFactory |
A filter to apply normal capitalization rules to Tokens.
|
| ChineseFilterFactory | Deprecated
Use
StopFilterFactory instead. |
| ChineseTokenizerFactory | Deprecated
Use
StandardTokenizerFactory instead. |
| CJKBigramFilterFactory |
Factory for
CJKBigramFilter. |
| CJKTokenizerFactory | Deprecated |
| CJKWidthFilterFactory |
Factory for
CJKWidthFilter. |
| ClassicFilterFactory |
Factory for
ClassicFilter. |
| ClassicTokenizerFactory |
Factory for
ClassicTokenizer. |
| CollationKeyFilterFactory |
Factory for
CollationKeyFilter. |
| CommonGramsFilter |
Construct bigrams for frequently occurring terms while indexing.
|
| CommonGramsFilterFactory |
Constructs a
CommonGramsFilter. |
| CommonGramsQueryFilter |
Wrap a CommonGramsFilter optimizing phrase queries by only returning single
words when they are not a member of a bigram.
|
| CommonGramsQueryFilterFactory |
Construct
CommonGramsQueryFilter. |
| CzechStemFilterFactory |
Factory for
CzechStemFilter. |
| DelimitedPayloadTokenFilterFactory |
Factory for
DelimitedPayloadTokenFilter. |
| DictionaryCompoundWordTokenFilterFactory |
Factory for
DictionaryCompoundWordTokenFilter. |
| DoubleMetaphoneFilter | Deprecated
Use
DoubleMetaphoneFilter instead |
| DoubleMetaphoneFilterFactory |
Factory for
DoubleMetaphoneFilter. |
| DutchStemFilterFactory | Deprecated
Use
SnowballPorterFilterFactory with "Dutch" instead,
which has the same functionality. |
| EdgeNGramFilterFactory |
Creates new instances of
EdgeNGramTokenFilter. |
| EdgeNGramTokenizerFactory |
Creates new instances of
EdgeNGramTokenizer. |
| ElisionFilterFactory |
Factory for
ElisionFilter. |
| EnglishMinimalStemFilterFactory |
Factory for
EnglishMinimalStemFilter. |
| EnglishPorterFilterFactory | Deprecated
Use SnowballPorterFilterFactory with language="English" instead
|
| EnglishPossessiveFilterFactory |
Factory for
EnglishPossessiveFilter. |
| FinnishLightStemFilterFactory |
Factory for
FinnishLightStemFilter. |
| FrenchLightStemFilterFactory |
Factory for
FrenchLightStemFilter. |
| FrenchMinimalStemFilterFactory |
Factory for
FrenchMinimalStemFilter. |
| FrenchStemFilterFactory | Deprecated
Use
SnowballPorterFilterFactory with "French" instead,
which has the same functionality. |
| GalicianMinimalStemFilterFactory |
Factory for
GalicianMinimalStemFilter. |
| GalicianStemFilterFactory |
Factory for
GalicianStemFilter. |
| GermanLightStemFilterFactory |
Factory for
GermanLightStemFilter. |
| GermanMinimalStemFilterFactory |
Factory for
GermanMinimalStemFilter. |
| GermanNormalizationFilterFactory |
Factory for
GermanNormalizationFilter. |
| GermanStemFilterFactory |
Factory for
GermanStemFilter. |
| GreekLowerCaseFilterFactory |
Factory for
GreekLowerCaseFilter. |
| GreekStemFilterFactory |
Factory for
GreekStemFilter. |
| HindiNormalizationFilterFactory |
Factory for
HindiNormalizationFilter. |
| HindiStemFilterFactory |
Factory for
HindiStemFilter. |
| HTMLStripCharFilterFactory |
Factory for
HTMLStripCharFilter. |
| HungarianLightStemFilterFactory |
Factory for
HungarianLightStemFilter. |
| HunspellStemFilterFactory |
TokenFilterFactory that creates instances of
HunspellStemFilter. |
| HyphenatedWordsFilter |
When the plain text is extracted from documents, we will often have many words hyphenated and broken into
two lines.
|
| HyphenatedWordsFilterFactory |
Factory for
HyphenatedWordsFilter. |
| HyphenationCompoundWordTokenFilterFactory |
Factory for
HyphenationCompoundWordTokenFilter. |
| ICUCollationKeyFilterFactory |
Factory for
ICUCollationKeyFilter. |
| ICUFoldingFilterFactory |
Factory for
ICUFoldingFilter |
| ICUNormalizer2FilterFactory |
Factory for
ICUNormalizer2Filter
Supports the following attributes:
name: A Unicode Normalization Form,
one of 'nfc','nfkc', 'nfkc_cf'. |
| ICUTokenizerFactory |
Factory for
ICUTokenizer |
| ICUTransformFilterFactory |
Factory for
ICUTransformFilter. |
| IndicNormalizationFilterFactory |
Factory for
IndicNormalizationFilter. |
| IndonesianStemFilterFactory |
Factory for
IndonesianStemFilter. |
| IrishLowerCaseFilterFactory |
Factory for
IrishLowerCaseFilter. |
| ISOLatin1AccentFilterFactory | Deprecated
Use
ASCIIFoldingFilterFactory instead. |
| ItalianLightStemFilterFactory |
Factory for
ItalianLightStemFilter. |
| JapaneseBaseFormFilterFactory |
Factory for
JapaneseBaseFormFilter. |
| JapaneseKatakanaStemFilterFactory |
Factory for
JapaneseKatakanaStemFilterFactory. |
| JapanesePartOfSpeechStopFilterFactory |
Factory for
JapanesePartOfSpeechStopFilter. |
| JapaneseReadingFormFilterFactory |
Factory for
JapaneseReadingFormFilter. |
| JapaneseTokenizerFactory |
Factory for
JapaneseTokenizer. |
| KeepWordFilter |
A TokenFilter that only keeps tokens with text contained in the
required words.
|
| KeepWordFilterFactory |
Factory for
KeepWordFilter. |
| KeywordMarkerFilterFactory |
Factory for
KeywordMarkerFilter. |
| KeywordTokenizerFactory |
Factory for
KeywordTokenizer. |
| KStemFilterFactory |
Factory for
KStemFilter |
| LatvianStemFilterFactory |
Factory for
LatvianStemFilter. |
| LegacyHTMLStripCharFilter | Deprecated
use
HTMLStripCharFilter |
| LegacyHTMLStripCharFilterFactory | Deprecated |
| LengthFilterFactory |
Factory for
LengthFilter. |
| LetterTokenizerFactory |
Factory for
LetterTokenizer. |
| LimitTokenCountFilterFactory |
Factory for
LimitTokenCountFilter. |
| LowerCaseFilterFactory |
Factory for
LowerCaseFilter. |
| LowerCaseTokenizerFactory |
Factory for
LowerCaseTokenizer. |
| MappingCharFilterFactory |
Factory for
MappingCharFilter. |
| NGramFilterFactory |
Factory for
NGramTokenFilter. |
| NGramTokenizerFactory |
Factory for
NGramTokenizer. |
| NorwegianLightStemFilterFactory |
Factory for
NorwegianLightStemFilter. |
| NorwegianMinimalStemFilterFactory |
Factory for
NorwegianMinimalStemFilter. |
| NumericPayloadTokenFilterFactory |
Factory for
NumericPayloadTokenFilter. |
| PathHierarchyTokenizerFactory |
Factory for
PathHierarchyTokenizer. |
| PatternReplaceCharFilter |
CharFilter that uses a regular expression for the target of replace string.
|
| PatternReplaceCharFilterFactory |
Factory for
PatternReplaceCharFilter. |
| PatternReplaceFilter |
A TokenFilter which applies a Pattern to each token in the stream,
replacing match occurances with the specified replacement string.
|
| PatternReplaceFilterFactory |
Factory for
PatternReplaceFilter. |
| PatternTokenizer |
This tokenizer uses regex pattern matching to construct distinct tokens
for the input stream.
|
| PatternTokenizerFactory |
Factory for
PatternTokenizer. |
| PersianCharFilterFactory |
Factory for
PersianCharFilter. |
| PersianNormalizationFilterFactory |
Factory for
PersianNormalizationFilter. |
| PhoneticFilter | Deprecated
Use
PhoneticFilter instead. |
| PhoneticFilterFactory |
Factory for
PhoneticFilter. |
| PorterStemFilterFactory |
Factory for
PorterStemFilter. |
| PortugueseLightStemFilterFactory |
Factory for
PortugueseLightStemFilter. |
| PortugueseMinimalStemFilterFactory |
Factory for
PortugueseMinimalStemFilter. |
| PortugueseStemFilterFactory |
Factory for
PortugueseStemFilter. |
| PositionFilterFactory |
Factory for
PositionFilter. |
| RemoveDuplicatesTokenFilter |
A TokenFilter which filters out Tokens at the same position and Term text as the previous token in the stream.
|
| RemoveDuplicatesTokenFilterFactory |
Factory for
RemoveDuplicatesTokenFilter. |
| ReversedWildcardFilter |
This class produces a special form of reversed tokens, suitable for
better handling of leading wildcards.
|
| ReversedWildcardFilterFactory |
Factory for
ReversedWildcardFilter-s. |
| ReverseStringFilterFactory |
Factory for
ReverseStringFilter. |
| RussianLetterTokenizerFactory | Deprecated
Use
StandardTokenizerFactory instead. |
| RussianLightStemFilterFactory |
Factory for
RussianLightStemFilter. |
| RussianLowerCaseFilterFactory | Deprecated
Use
LowerCaseFilterFactory instead which has the
same functionality. |
| RussianStemFilterFactory | Deprecated
Use
SnowballPorterFilterFactory with "Russian" instead,
which has the same functionality. |
| ShingleFilterFactory |
Factory for
ShingleFilter. |
| SmartChineseSentenceTokenizerFactory |
Factory for the SmartChineseAnalyzer
SentenceTokenizer |
| SmartChineseWordTokenFilterFactory |
Factory for the SmartChineseAnalyzer
WordTokenFilter
Note: this class will currently emit tokens for punctuation. |
| SnowballPorterFilterFactory |
Factory for
SnowballFilter, with configurable language
Note: Use of the "Lovins" stemmer is not recommended, as it is implemented with reflection. |
| SolrAnalyzer | |
| SolrAnalyzer.TokenStreamInfo | |
| SpanishLightStemFilterFactory |
Factory for
SpanishLightStemFilter. |
| StandardFilterFactory |
Factory for
StandardFilter. |
| StandardTokenizerFactory |
Factory for
StandardTokenizer. |
| StemmerOverrideFilterFactory |
Factory for
StemmerOverrideFilter. |
| StempelPolishStemFilterFactory |
Factory for
StempelFilter using a Polish stemming table. |
| StopFilterFactory |
Factory for
StopFilter. |
| SwedishLightStemFilterFactory |
Factory for
SwedishLightStemFilter. |
| SynonymFilterFactory |
Factory for
SynonymFilter. |
| ThaiWordFilterFactory |
Factory for
ThaiWordFilter. |
| TokenizerChain | |
| TokenOffsetPayloadTokenFilterFactory |
Factory for
TokenOffsetPayloadTokenFilter. |
| TrieTokenizerFactory |
Tokenizer for trie fields.
|
| TrimFilter |
Trims leading and trailing whitespace from Tokens in the stream.
|
| TrimFilterFactory |
Factory for
TrimFilter. |
| TurkishLowerCaseFilterFactory |
Factory for
TurkishLowerCaseFilter. |
| TypeAsPayloadTokenFilterFactory |
Factory for
TypeAsPayloadTokenFilter. |
| TypeTokenFilterFactory |
Factory class for
TypeTokenFilter. |
| UAX29URLEmailTokenizerFactory |
Factory for
UAX29URLEmailTokenizer. |
| WhitespaceTokenizerFactory |
Factory for
WhitespaceTokenizer. |
| WikipediaTokenizerFactory |
Factory for
WikipediaTokenizer. |
| WordDelimiterFilterFactory |
Factory for WordDelimiterFilter.
|
| WordDelimiterIterator |
A BreakIterator-like API for iterating over subwords in text, according to WordDelimiterFilter rules.
|