| ArabicLetterTokenizerFactory |
Deprecated
(3.1) Use StandardTokenizerFactory instead.
|
| ArabicNormalizationFilterFactory |
|
| ArabicStemFilterFactory |
|
| ASCIIFoldingFilterFactory |
|
| BaseCharFilterFactory |
|
| BaseTokenFilterFactory |
Simple abstract implementation that handles init arg processing.
|
| BaseTokenizerFactory |
Simple abstract implementation that handles init arg processing.
|
| BeiderMorseFilterFactory |
|
| BrazilianStemFilterFactory |
|
| BufferedTokenStream |
Deprecated
This class does not support custom attributes.
|
| BulgarianStemFilterFactory |
|
| CapitalizationFilterFactory |
A filter to apply normal capitalization rules to Tokens.
|
| ChineseFilterFactory |
Deprecated
|
| ChineseTokenizerFactory |
Deprecated
|
| CJKBigramFilterFactory |
|
| CJKTokenizerFactory |
Deprecated |
| CJKWidthFilterFactory |
|
| ClassicFilterFactory |
|
| ClassicTokenizerFactory |
|
| CollationKeyFilterFactory |
|
| CommonGramsFilter |
Construct bigrams for frequently occurring terms while indexing.
|
| CommonGramsFilterFactory |
|
| CommonGramsQueryFilter |
Wrap a CommonGramsFilter optimizing phrase queries by only returning single
words when they are not a member of a bigram.
|
| CommonGramsQueryFilterFactory |
|
| CzechStemFilterFactory |
|
| DelimitedPayloadTokenFilterFactory |
|
| DictionaryCompoundWordTokenFilterFactory |
|
| DoubleMetaphoneFilter |
Deprecated
|
| DoubleMetaphoneFilterFactory |
|
| DutchStemFilterFactory |
Deprecated
|
| EdgeNGramFilterFactory |
|
| EdgeNGramTokenizerFactory |
|
| ElisionFilterFactory |
|
| EnglishMinimalStemFilterFactory |
|
| EnglishPorterFilterFactory |
Deprecated
Use SnowballPorterFilterFactory with language="English" instead
|
| EnglishPossessiveFilterFactory |
|
| FinnishLightStemFilterFactory |
|
| FrenchLightStemFilterFactory |
|
| FrenchMinimalStemFilterFactory |
|
| FrenchStemFilterFactory |
Deprecated
|
| GalicianMinimalStemFilterFactory |
|
| GalicianStemFilterFactory |
|
| GermanLightStemFilterFactory |
|
| GermanMinimalStemFilterFactory |
|
| GermanNormalizationFilterFactory |
|
| GermanStemFilterFactory |
|
| GreekLowerCaseFilterFactory |
|
| GreekStemFilterFactory |
|
| HindiNormalizationFilterFactory |
|
| HindiStemFilterFactory |
|
| HTMLStripCharFilterFactory |
|
| HungarianLightStemFilterFactory |
|
| HunspellStemFilterFactory |
|
| HyphenatedWordsFilter |
When the plain text is extracted from documents, we will often have many words hyphenated and broken into
two lines.
|
| HyphenatedWordsFilterFactory |
|
| HyphenationCompoundWordTokenFilterFactory |
|
| ICUCollationKeyFilterFactory |
|
| ICUFoldingFilterFactory |
|
| ICUNormalizer2FilterFactory |
|
| ICUTokenizerFactory |
|
| ICUTransformFilterFactory |
|
| IndicNormalizationFilterFactory |
|
| IndonesianStemFilterFactory |
|
| IrishLowerCaseFilterFactory |
|
| ISOLatin1AccentFilterFactory |
Deprecated
|
| ItalianLightStemFilterFactory |
|
| JapaneseBaseFormFilterFactory |
|
| JapaneseKatakanaStemFilterFactory |
|
| JapanesePartOfSpeechStopFilterFactory |
|
| JapaneseReadingFormFilterFactory |
|
| JapaneseTokenizerFactory |
|
| KeepWordFilter |
A TokenFilter that only keeps tokens with text contained in the
required words.
|
| KeepWordFilterFactory |
|
| KeywordMarkerFilterFactory |
|
| KeywordTokenizerFactory |
|
| KStemFilterFactory |
|
| LatvianStemFilterFactory |
|
| LegacyHTMLStripCharFilter |
Deprecated
|
| LegacyHTMLStripCharFilterFactory |
Deprecated
|
| LengthFilterFactory |
|
| LetterTokenizerFactory |
|
| LimitTokenCountFilterFactory |
|
| LowerCaseFilterFactory |
|
| LowerCaseTokenizerFactory |
|
| MappingCharFilterFactory |
|
| NGramFilterFactory |
|
| NGramTokenizerFactory |
|
| NorwegianLightStemFilterFactory |
|
| NorwegianMinimalStemFilterFactory |
|
| NumericPayloadTokenFilterFactory |
|
| PathHierarchyTokenizerFactory |
|
| PatternReplaceCharFilter |
CharFilter that uses a regular expression for the target of replace string.
|
| PatternReplaceCharFilterFactory |
|
| PatternReplaceFilter |
A TokenFilter which applies a Pattern to each token in the stream,
replacing match occurances with the specified replacement string.
|
| PatternReplaceFilterFactory |
|
| PatternTokenizer |
This tokenizer uses regex pattern matching to construct distinct tokens
for the input stream.
|
| PatternTokenizerFactory |
|
| PersianCharFilterFactory |
|
| PersianNormalizationFilterFactory |
|
| PhoneticFilter |
Deprecated
|
| PhoneticFilterFactory |
|
| PorterStemFilterFactory |
|
| PortugueseLightStemFilterFactory |
|
| PortugueseMinimalStemFilterFactory |
|
| PortugueseStemFilterFactory |
|
| PositionFilterFactory |
|
| RemoveDuplicatesTokenFilter |
A TokenFilter which filters out Tokens at the same position and Term text as the previous token in the stream.
|
| RemoveDuplicatesTokenFilterFactory |
|
| ReversedWildcardFilter |
This class produces a special form of reversed tokens, suitable for
better handling of leading wildcards.
|
| ReversedWildcardFilterFactory |
|
| ReverseStringFilterFactory |
|
| RussianLetterTokenizerFactory |
Deprecated
|
| RussianLightStemFilterFactory |
|
| RussianLowerCaseFilterFactory |
Deprecated
|
| RussianStemFilterFactory |
Deprecated
|
| ShingleFilterFactory |
|
| SmartChineseSentenceTokenizerFactory |
|
| SmartChineseWordTokenFilterFactory |
Factory for the SmartChineseAnalyzer WordTokenFilter
Note: this class will currently emit tokens for punctuation.
|
| SnowballPorterFilterFactory |
Factory for SnowballFilter, with configurable language
Note: Use of the "Lovins" stemmer is not recommended, as it is implemented with reflection.
|
| SolrAnalyzer |
|
| SolrAnalyzer.TokenStreamInfo |
|
| SpanishLightStemFilterFactory |
|
| StandardFilterFactory |
|
| StandardTokenizerFactory |
|
| StemmerOverrideFilterFactory |
|
| StempelPolishStemFilterFactory |
|
| StopFilterFactory |
|
| SwedishLightStemFilterFactory |
|
| SynonymFilterFactory |
|
| ThaiWordFilterFactory |
|
| TokenizerChain |
|
| TokenOffsetPayloadTokenFilterFactory |
|
| TrieTokenizerFactory |
Tokenizer for trie fields.
|
| TrimFilter |
Trims leading and trailing whitespace from Tokens in the stream.
|
| TrimFilterFactory |
|
| TurkishLowerCaseFilterFactory |
|
| TypeAsPayloadTokenFilterFactory |
|
| TypeTokenFilterFactory |
|
| UAX29URLEmailTokenizerFactory |
|
| WhitespaceTokenizerFactory |
|
| WikipediaTokenizerFactory |
|
| WordDelimiterFilterFactory |
Factory for WordDelimiterFilter.
|
| WordDelimiterIterator |
A BreakIterator-like API for iterating over subwords in text, according to WordDelimiterFilter rules.
|