Package-level declarations

Types

Link copied to clipboard
abstract class BloomFilterFactory

Class used to create index-time FuzzySet appropriately configured for each field. Also called to right-size bitsets for serialization.

Link copied to clipboard
class BloomFilteringPostingsFormat(delegatePostingsFormat: PostingsFormat? = null, bloomFilterFactory: BloomFilterFactory = DefaultBloomFilterFactory()) : PostingsFormat

A PostingsFormat useful for low doc-frequency fields such as primary keys. Bloom filters are maintained in a ".blm" file which offers "fast-fail" for reads in segments known to have no record of the key. A choice of delegate PostingsFormat is used to record all other Postings data.

Link copied to clipboard

Default policy is to allocate a bitset with 10% saturation given a unique term per document. Bits are set via MurmurHash2 hashing function.

Link copied to clipboard

A class used to represent a set of many, potentially large, values (e.g. many long strings such as URLs), using a significantly smaller amount of memory.