Class BM25SimilarityFactory

  • public class BM25SimilarityFactory
    extends SimilarityFactory
    Factory for BM25Similarity. This is the default similarity since 8.x.


    • k1 (float): Controls non-linear term frequency normalization (saturation). The default is 1.2
    • b (float): Controls to what degree document length normalizes tf values. The default is 0.75
    • discountOverlaps (bool): True if overlap tokens (tokens with a position of increment of zero) are discounted from the document's length. The default is true
    WARNING: This API is experimental and might change in incompatible ways in the next release.