public class TrieField extends PrimitiveFieldType
LegacyIntField, LegacyLongField, LegacyFloatField and
LegacyDoubleField.
See LegacyNumericRangeQuery for more details.
It supports integer, float, long, double and date types.
For each number being added to this field, multiple terms are generated as per the algorithm described in the above link. The possible number of terms increases dramatically with lower precision steps. For the fast range search to work, trie fields must be indexed.
Trie fields are sortable in numerical order and can be used in function queries.
Note that if you use a precisionStep of 32 for int/float and 64 for long/double/date, then multiple terms will not be generated, range search will be no faster than any other number field, but sorting will still be possible.
LegacyNumericRangeQuery| Modifier and Type | Class and Description |
|---|---|
static class |
TrieField.TrieTypes |
FieldType.DefaultAnalyzer, FieldType.MultiValueSelector| Modifier and Type | Field and Description |
|---|---|
static int |
DEFAULT_PRECISION_STEP |
protected int |
precisionStep |
protected int |
precisionStepArg |
protected TrieField.TrieTypes |
type |
ANALYZER, args, CHAR_FILTER, CHAR_FILTERS, CLASS_NAME, docValuesFormat, falseProperties, FILTER, FILTERS, INDEX, INDEX_ANALYZER, MULTI_TERM, MULTI_TERM_ANALYZER, POLY_FIELD_SEPARATOR, postingsFormat, properties, QUERY, QUERY_ANALYZER, similarity, SIMILARITY, similarityFactory, TOKENIZER, trueProperties, TYPE, TYPE_NAME, typeNameBINARY, DOC_VALUES, INDEXED, MULTIVALUED, OMIT_NORMS, OMIT_POSITIONS, OMIT_TF_POSITIONS, REQUIRED, SORT_MISSING_FIRST, SORT_MISSING_LAST, STORE_OFFSETS, STORE_TERMOFFSETS, STORE_TERMPAYLOADS, STORE_TERMPOSITIONS, STORE_TERMVECTORS, STORED, TOKENIZED, USE_DOCVALUES_AS_STORED| Constructor and Description |
|---|
TrieField() |
| Modifier and Type | Method and Description |
|---|---|
void |
checkSchemaField(SchemaField field)
Check's
SchemaField instances constructed
using this field type to ensure that they are valid. |
IndexableField |
createField(SchemaField field,
Object value,
float boost)
Used for adding a document when a field needs to be created from a
type and a string.
|
List<IndexableField> |
createFields(SchemaField sf,
Object value,
float boost)
Given a
SchemaField, create one or more IndexableField instances |
Query |
getFieldQuery(QParser parser,
SchemaField field,
String externalVal)
Returns a Query instance for doing searches against a field.
|
static String |
getMainValuePrefix(FieldType ft)
expert internal use, subject to change.
|
FieldType.LegacyNumericType |
getNumericType()
Return the numeric type of this field, or null if this field is not a
numeric field.
|
int |
getPrecisionStep() |
Query |
getRangeQuery(QParser parser,
SchemaField field,
String min,
String max,
boolean minInclusive,
boolean maxInclusive)
Returns a Query instance for doing range searches on this field type.
|
ValueSource |
getSingleValueSource(FieldType.MultiValueSelector choice,
SchemaField field,
QParser parser)
Method for dynamically building a ValueSource based on a single value of a multivalued field.
|
protected ValueSource |
getSingleValueSource(SortedSetSelector.Type choice,
SchemaField field)
Helper method that will only be called for multivalued Trie fields that have doc values.
|
SortField |
getSortField(SchemaField field,
boolean top)
Returns the SortField instance that should be used to sort fields
of this type.
|
TrieField.TrieTypes |
getType() |
UninvertingReader.Type |
getUninversionType(SchemaField sf)
DocValues is not enabled for a field, but it's indexed, docvalues can be constructed
on the fly (uninverted, aka fieldcache) on the first request to sort, facet, etc.
|
ValueSource |
getValueSource(SchemaField field,
QParser qparser)
called to get the default value source (normally, from the
Lucene FieldCache.)
|
CharsRef |
indexedToReadable(BytesRef indexedForm,
CharsRefBuilder charsRef)
Given an indexed term, append the human readable representation
|
String |
indexedToReadable(String _indexedForm)
Given an indexed term, return the human readable representation
|
protected void |
init(IndexSchema schema,
Map<String,String> args)
subclasses should initialize themselves with the args provided
and remove valid arguments.
|
boolean |
isTokenized()
Returns true if fields of this type should be tokenized
|
boolean |
multiValuedFieldCache()
Returns true if a single field value of this type has multiple logical values
for the purposes of faceting, sorting, etc.
|
void |
readableToIndexed(CharSequence val,
BytesRefBuilder result)
Given the readable value, return the term value that will match it.
|
String |
readableToIndexed(String val)
Given the readable value, return the term value that will match it.
|
String |
storedToIndexed(IndexableField f)
Given the stored field, return the indexed form
|
String |
storedToReadable(IndexableField f)
Given the stored field, return the human readable representation
|
String |
toExternal(IndexableField f)
Convert the stored-field format to an external (string, human readable)
value
|
String |
toInternal(String val)
Convert an external value (from XML update command or from query string)
into the internal format for both storing and indexing (which can be modified by any analyzers).
|
Object |
toObject(IndexableField f)
Convert the stored-field format to an external object.
|
Object |
toObject(SchemaField sf,
BytesRef term) |
void |
write(TextResponseWriter writer,
String name,
IndexableField f)
calls back to TextResponseWriter to write the field value
|
createField, getAnalyzerProperties, getClassArg, getDocValuesFormat, getIndexAnalyzer, getIndexOptions, getNamedPropertyValues, getNonFieldPropertyArgs, getPostingsFormat, getPrefixQuery, getQueryAnalyzer, getRewriteMethod, getSimilarity, getSimilarityFactory, getStringSort, getTypeName, hasProperty, isExplicitAnalyzer, isExplicitQueryAnalyzer, isMultiValued, isPolyField, marshalBase64SortValue, marshalSortValue, marshalStringSortValue, restrictProps, setArgs, setIndexAnalyzer, setIsExplicitAnalyzer, setIsExplicitQueryAnalyzer, setQueryAnalyzer, setSimilarity, supportsAnalyzers, toNativeType, toString, unmarshalBase64SortValue, unmarshalSortValue, unmarshalStringSortValue, useDocValuesAsStoredpublic static final int DEFAULT_PRECISION_STEP
protected int precisionStepArg
protected int precisionStep
protected TrieField.TrieTypes type
protected void init(IndexSchema schema, Map<String,String> args)
FieldTypeinit in class PrimitiveFieldTypepublic Object toObject(IndexableField f)
FieldTypetoObject in class FieldTypeFieldType.toInternal(java.lang.String)public SortField getSortField(SchemaField field, boolean top)
FieldTypegetSortField in class FieldTypeSchemaField.checkSortability()public UninvertingReader.Type getUninversionType(SchemaField sf)
FieldTypegetUninversionType in class FieldTypesf - field instancenull (to disallow uninversion for the field)public ValueSource getValueSource(SchemaField field, QParser qparser)
FieldTypegetValueSource in class FieldTypepublic final ValueSource getSingleValueSource(FieldType.MultiValueSelector choice, SchemaField field, QParser parser)
FieldTypeSchemaField that is in fact not-multivalued, in which case it delegates to
FieldType.getValueSource(org.apache.solr.schema.SchemaField, org.apache.solr.search.QParser)getSingleValueSource in class FieldTypeFieldType.MultiValueSelectorprotected ValueSource getSingleValueSource(SortedSetSelector.Type choice, SchemaField field)
choice - the selector Type to use, will never be nullfield - the field to use, garunteed to be multivalued.FieldType.getSingleValueSource(MultiValueSelector,SchemaField,QParser)public void write(TextResponseWriter writer, String name, IndexableField f) throws IOException
FieldTypewrite in class FieldTypeIOExceptionpublic boolean isTokenized()
FieldTypeisTokenized in class FieldTypepublic boolean multiValuedFieldCache()
FieldTypemultiValuedFieldCache in class FieldTypepublic int getPrecisionStep()
public TrieField.TrieTypes getType()
public FieldType.LegacyNumericType getNumericType()
FieldTypegetNumericType in class FieldTypepublic Query getRangeQuery(QParser parser, SchemaField field, String min, String max, boolean minInclusive, boolean maxInclusive)
FieldTypeSolrQueryParser
currently passes part1 and part2 as null if they are '*' respectively. minInclusive and maxInclusive are both true
currently by SolrQueryParser but that may change in the future. Also, other QueryParser implementations may have
different semantics.
Sub-classes should override this method to provide their own range query implementation. They should strive to handle nulls in part1 and/or part2 as well as unequal minInclusive and maxInclusive parameters gracefully.
getRangeQuery in class FieldTypeparser - the QParser calling the methodfield - the schema fieldmin - the lower boundary of the range, nulls are allowed.max - the upper boundary of the range, nulls are allowedminInclusive - whether the minimum of the range is inclusive or notmaxInclusive - whether the maximum of the range is inclusive or notpublic Query getFieldQuery(QParser parser, SchemaField field, String externalVal)
FieldTypegetFieldQuery in class FieldTypeparser - The QParser calling the methodfield - The SchemaField of the field to searchexternalVal - The String representation of the value to searchQuery instance. This implementation returns a TermQuery but overriding queries may notpublic String storedToReadable(IndexableField f)
FieldTypestoredToReadable in class FieldTypepublic String readableToIndexed(String val)
FieldTypereadableToIndexed in class FieldTypepublic void readableToIndexed(CharSequence val, BytesRefBuilder result)
FieldTypereadableToIndexed in class FieldTypepublic String toInternal(String val)
FieldTypetoInternal in class FieldTypeFieldType.toExternal(org.apache.lucene.index.IndexableField)public String toExternal(IndexableField f)
FieldTypetoExternal in class FieldTypeFieldType.toInternal(java.lang.String)public String indexedToReadable(String _indexedForm)
FieldTypeindexedToReadable in class FieldTypepublic CharsRef indexedToReadable(BytesRef indexedForm, CharsRefBuilder charsRef)
FieldTypeindexedToReadable in class FieldTypepublic Object toObject(SchemaField sf, BytesRef term)
public String storedToIndexed(IndexableField f)
FieldTypestoredToIndexed in class FieldTypepublic IndexableField createField(SchemaField field, Object value, float boost)
FieldTypeBy default, the indexed value is the same as the stored value (taken from toInternal()). Having a different representation for external, internal, and indexed would present quite a few problems given the current Lucene architecture. An analyzer for adding docs would need to translate internal->indexed while an analyzer for querying would need to translate external->indexed.
The only other alternative to having internal==indexed would be to have internal==external. In this case, toInternal should convert to the indexed representation, toExternal() should do nothing, and createField() should *not* call toInternal, but use the external value and set tokenized=true to get Lucene to convert to the internal(indexed) form.
:TODO: clean up and clarify this explanation.createField in class FieldTypeFieldType.toInternal(java.lang.String)public List<IndexableField> createFields(SchemaField sf, Object value, float boost)
FieldTypeSchemaField, create one or more IndexableField instancescreateFields in class FieldTypesf - the SchemaFieldvalue - The value to add to the fieldboost - The boost to applyIndexableFieldFieldType.createField(SchemaField, Object, float),
FieldType.isPolyField()public static String getMainValuePrefix(FieldType ft)
public void checkSchemaField(SchemaField field)
FieldTypeSchemaField instances constructed
using this field type to ensure that they are valid.
This method is called by the SchemaField constructor to
check that its initialization does not violate any fundemental
requirements of the FieldType. The default implementation
does nothing, but subclasses may chose to throw a SolrException
if invariants are violated by the SchemaField.
checkSchemaField in class FieldTypeCopyright © 2000-2016 Apache Software Foundation. All Rights Reserved.