Class BM25SimilarityFactory


  • public class BM25SimilarityFactory
    extends SimilarityFactory
    Factory for BM25Similarity. This is the default similarity since 8.x.

    Parameters:

    • k1 (float): Controls non-linear term frequency normalization (saturation). The default is 1.2
    • b (float): Controls to what degree document length normalizes tf values. The default is 0.75
    • discountOverlaps (bool): True if overlap tokens (tokens with a position of increment of zero) are discounted from the document's length. The default is true
    Since:
    8.0.0
    WARNING: This API is experimental and might change in incompatible ways in the next release.