core/org.gnit.lucenekmp.index

Package-level declarations

Types

open class AutomatonTermsEnum(tenum: TermsEnum?, compiled: CompiledAutomaton) : FilteredTermsEnum

A FilteredTermsEnum that enumerates terms based upon what is accepted by a DFA.

abstract class BaseCompositeReader<R : IndexReader> : CompositeReader

Base class for implementing CompositeReaders based on an array of sub-readers. The implementing class has to add code for correctly refcounting and closing the sub-readers.

BaseTermsEnum

abstract class BaseTermsEnum : TermsEnum

A base TermsEnum that adds default implementations for

BinaryDocValues

abstract class BinaryDocValues : DocValuesIterator

A per-document numeric value.

BufferedUpdates

class BufferedUpdates(val segmentName: String) : Accountable

Holds buffered deletes and updates, by docID, term or query for a single segment. This is used to hold buffered pending deletes and updates against the to-be-flushed segment. Once the deletes and updates are pushed (on flush in DocumentsWriter), they are converted to a [ ] instance and pushed to the BufferedUpdatesStream.

BufferedUpdatesStream

class BufferedUpdatesStream(infoStream: InfoStream) : Accountable

Tracks the stream of FrozenBufferedUpdates. When DocumentsWriterPerThread flushes, its buffered deletes and updates are appended to this stream and immediately resolved (to actual docIDs, per segment) using the indexing thread that triggered the flush for concurrency. When a merge kicks off, we sync to ensure all resolving packets complete. We also apply to all segments when NRT reader is pulled, commit/close is called, or when too many deletes or updates are buffered and must be flushed (by RAM usage or by count).

ByteSliceReader

class ByteSliceReader : DataInput

IndexInput that knows how to read the byte slices written by Posting and PostingVector. We read the bytes in each slice until we hit the end of that slice at which point we read the forwarding address of the next slice and then jump to it.

ByteVectorValues

abstract class ByteVectorValues : KnnVectorValues

This class provides access to per-document floating point vector values indexed as [ ].

CheckIndex

class CheckIndex(dir: Directory, writeLock: Lock = dir.obtainLock(IndexWriter.WRITE_LOCK_NAME)) : AutoCloseable

Basic tool and API to check the health of an index and write a new segments file that removes reference to problematic segments.

CodecReader

abstract class CodecReader : LeafReader

LeafReader implemented by codec APIs.

CompositeReader

abstract class CompositeReader : IndexReader

Instances of this reader type can only be used to get stored fields from the underlying LeafReaders, but it is not possible to directly retrieve postings. To do that, get the [ ] for all sub-readers via .leaves.

CompositeReaderContext

class CompositeReaderContext : IndexReaderContext

IndexReaderContext for CompositeReader instance.

ConcurrentMergeScheduler

open class ConcurrentMergeScheduler : MergeScheduler

A MergeScheduler that runs each merge using a separate thread.

CorruptIndexException

class CorruptIndexException(originalMessage: String, val resourceDescription: String, cause: Throwable? = null) : IOException

This exception is thrown when Lucene detects an inconsistency in the index.

DirectoryReader

abstract class DirectoryReader : BaseCompositeReader<LeafReader>

DirectoryReader is an implementation of CompositeReader that can read indexes in a [ ].

DocIDMerger

abstract class DocIDMerger<T : DocIDMerger.Sub>

Utility class to help merging documents from sub-readers according to either simple concatenated (unsorted) order, or by a specified index-time sort, skipping deleted documents and remapping non-deleted documents.

DocsWithFieldSet

class DocsWithFieldSet : DocIdSet

Accumulator for documents that have a value for a field. This is optimized for the case that all documents have a value.

DocumentsWriter

class DocumentsWriter(flushNotifications: DocumentsWriter.FlushNotifications, indexCreatedVersionMajor: Int, pendingNumDocs: AtomicLong, enableTestPoints: Boolean, segmentNameSupplier: () -> String, config: LiveIndexWriterConfig, directoryOrig: Directory, directory: Directory, globalFieldNumberMap: FieldInfos.FieldNumbers) : AutoCloseable, Accountable

This class accepts multiple added documents and directly writes segment files.

DocumentsWriterDeleteQueue

class DocumentsWriterDeleteQueue : Accountable, AutoCloseable

DocumentsWriterDeleteQueue is a non-blocking linked pending deletes queue. In contrast to other queue implementation we only maintain the tail of the queue. A delete queue is always used in a context of a set of DWPTs and a global delete pool. Each of the DWPT and the global pool need to maintain their 'own' head of the queue (as a DeleteSlice instance per [ ]). The difference between the DWPT and the global pool is that the DWPT starts maintaining a head once it has added its first document since for its segments private deletes only the deletes after that document are relevant. The global pool instead starts maintaining the head once this instance is created by taking the sentinel instance as its initial head.

DocumentsWriterFlushControl

class DocumentsWriterFlushControl(documentsWriter: DocumentsWriter, config: LiveIndexWriterConfig) : Accountable, AutoCloseable

This class controls DocumentsWriterPerThread flushing during indexing. It tracks the memory consumption per DocumentsWriterPerThread and uses a configured FlushPolicy to decide if a DocumentsWriterPerThread must flush.

DocumentsWriterFlushQueue

class DocumentsWriterFlushQueue

DocumentsWriterPerThread

class DocumentsWriterPerThread(indexMajorVersionCreated: Int, segmentName: String, directoryOrig: Directory, directory: Directory, indexWriterConfig: LiveIndexWriterConfig, val deleteQueue: DocumentsWriterDeleteQueue, fieldInfos: FieldInfos.Builder, pendingNumDocs: AtomicLong, enableTestPoints: Boolean) : Accountable, Lock

DocumentsWriterPerThreadPool

class DocumentsWriterPerThreadPool(dwptFactory: () -> DocumentsWriterPerThread) : Iterable<DocumentsWriterPerThread> , AutoCloseable

DocumentsWriterPerThreadPool controls DocumentsWriterPerThread instances and their thread assignments during indexing. Each DocumentsWriterPerThread is, once obtained from the pool, exclusively used for indexing a single document or list of documents by the obtaining thread. Each indexing thread must obtain such a DocumentsWriterPerThread to make progress. Depending on the DocumentsWriterPerThreadPool implementation [ ] assignments might differ from document to document.

DocumentsWriterStallControl

class DocumentsWriterStallControl

Controls the health status of a DocumentsWriter sessions. This class used to block incoming indexing threads if flushing significantly slower than indexing to ensure the [ ]s healthiness. If flushing is significantly slower than indexing the net memory used within an IndexWriter session can increase very quickly and easily exceed the JVM's available memory.

DocValues

object DocValues

This class contains utility methods and constants for DocValues

DocValuesFieldUpdates

abstract class DocValuesFieldUpdates : Accountable

Holds updates of a single DocValues field, for a set of documents within one segment.

DocValuesIterator

abstract class DocValuesIterator : DocIdSetIterator

DocValuesSkipIndexType

enum DocValuesSkipIndexType : Enum<DocValuesSkipIndexType>

Options for skip indexes on doc values.

DocValuesSkipper

abstract class DocValuesSkipper

Skipper for DocValues.

DocValuesType

enum DocValuesType : Enum<DocValuesType>

DocValues types. Note that DocValues is strongly typed, so a field cannot have different types across different documents.

DocValuesUpdate

abstract class DocValuesUpdate

An in-place update to a DocValues field.

EmptyDocValuesProducer

abstract class EmptyDocValuesProducer : DocValuesProducer

Abstract base class implementing a DocValuesProducer that has no doc values.

ExitableDirectoryReader

class ExitableDirectoryReader(in: DirectoryReader, queryTimeout: QueryTimeout) : FilterDirectoryReader

The ExitableDirectoryReader wraps a real index DirectoryReader and allows for a QueryTimeout implementation object to be checked periodically to see if the thread should exit or not. If QueryTimeout.shouldExit returns true, an ExitingReaderException is thrown.

FieldInfo

class FieldInfo(val name: String, val number: Int, storeTermVector: Boolean, omitNorms: Boolean, storePayloads: Boolean, indexOptions: IndexOptions, docValues: DocValuesType, docValuesSkipIndex: DocValuesSkipIndexType, dvGen: Long, attributes: Map<String, String>, pointDimensionCount: Int, pointIndexDimensionCount: Int, pointNumBytes: Int, vectorDimension: Int, vectorEncoding: VectorEncoding, vectorSimilarityFunction: VectorSimilarityFunction, softDeletesField: Boolean, isParentField: Boolean)

Access to the Field Info file that describes document fields and whether or not they are indexed. Each segment has a separate Field Info file. Objects of this class are thread-safe for multiple readers, but only one thread can be adding documents at a time, with no other reader or writer threads accessing this object.

FieldInfos

open class FieldInfos(infos: Array<FieldInfo>) : Iterable<FieldInfo>

Collection of FieldInfos (accessible by number or by name).

FieldInvertState

class FieldInvertState(val indexCreatedVersionMajor: Int, val name: String?, val indexOptions: IndexOptions?)

This class tracks the number and position / offset parameters of terms being added to the index. The information collected in this class is also used to calculate the normalization factor for a field.

Fields

abstract class Fields : Iterable<String>

Provides a Terms index for fields that have it, and lists which fields do. This is primarily an internal/experimental API (see FieldsProducer), although it is also used to expose the set of term vectors per document.

FieldTermIterator

abstract class FieldTermIterator : BytesRefIterator

Iterates over terms in across multiple fields. The caller must check .field after each .next to see if the field changed, but == can be used since the iterator implementation ensures it will use the same String instance for a given field.

FieldUpdatesBuffer

class FieldUpdatesBuffer

This class efficiently buffers numeric and binary field updates and stores terms, values and metadata in a memory efficient way without creating large amounts of objects. Update terms are stored without de-duplicating the update term. In general we try to optimize for several use-cases. For instance we try to use constant space for update terms field since the common case always updates on the same field. Also for docUpTo we try to optimize for the case when updates should be applied to all docs ie. docUpTo=Integer.MAX_VALUE. In other cases each update will likely have a different docUpTo. Along the same lines this impl optimizes the case when all updates have a value. Lastly, if all updates share the same value for a numeric field we only store the value once.

FilterBinaryDocValues

abstract class FilterBinaryDocValues : BinaryDocValues

Delegates all methods to a wrapped BinaryDocValues.

FilterCodecReader

abstract class FilterCodecReader(in: CodecReader) : CodecReader

A FilterCodecReader contains another CodecReader, which it uses as its basic source of data, possibly transforming the data along the way or providing additional functionality.

FilterDirectoryReader

abstract class FilterDirectoryReader(in: DirectoryReader, wrapper: FilterDirectoryReader.SubReaderWrapper) : DirectoryReader

A FilterDirectoryReader wraps another DirectoryReader, allowing implementations to transform or extend it.

FilteredTermsEnum

abstract class FilteredTermsEnum : TermsEnum

Abstract class for enumerating a subset of all terms.

FilterLeafReader

abstract class FilterLeafReader : LeafReader

A FilterLeafReader contains another LeafReader, which it uses as its basic source of data, possibly transforming the data along the way or providing additional functionality. The class FilterLeafReader itself simply implements all abstract methods of IndexReader with versions that pass all requests to the contained index reader. Subclasses of FilterLeafReader may further override some of these methods and may also provide additional methods and fields.

FilterMergePolicy

open class FilterMergePolicy(in: MergePolicy) : MergePolicy, Unwrappable<MergePolicy>

A wrapper for MergePolicy instances.

FilterNumericDocValues

abstract class FilterNumericDocValues : NumericDocValues

Delegates all methods to a wrapped NumericDocValues.

FilterSortedDocValues

abstract class FilterSortedDocValues(in: SortedDocValues) : SortedDocValues

Delegates all methods to a wrapped SortedDocValues.

FilterSortedNumericDocValues

abstract class FilterSortedNumericDocValues(in: SortedNumericDocValues) : SortedNumericDocValues

Delegates all methods to a wrapped SortedNumericDocValues.

FilterSortedSetDocValues

open class FilterSortedSetDocValues(in: SortedSetDocValues) : SortedSetDocValues

Delegates all methods to a wrapped SortedSetDocValues.

FloatVectorValues

abstract class FloatVectorValues : KnnVectorValues

This class provides access to per-document floating point vector values indexed as [ ].

FlushPolicy

abstract class FlushPolicy

FlushPolicy controls when segments are flushed from a RAM resident internal data-structure to the IndexWriters Directory.

FreqProxTermsWriter

class FreqProxTermsWriter(intBlockAllocator: IntBlockPool.Allocator, byteBlockAllocator: ByteBlockPool.Allocator, bytesUsed: Counter, termVectors: TermsHash) : TermsHash

FrozenBufferedUpdates

class FrozenBufferedUpdates(infoStream: InfoStream, updates: BufferedUpdates, val privateSegment: SegmentCommitInfo?)

Holds buffered deletes and updates by term or query, once pushed. Pushed deletes/updates are write-once, so we shift to more memory efficient data structure to hold them. We don't hold docIDs because these are applied on flush.

Impact

class Impact(var freq: Int, var norm: Long)

Per-document scoring factors.

Impacts

abstract class Impacts

Information about upcoming impacts, ie. (freq, norm) pairs.

ImpactsEnum

abstract class ImpactsEnum : PostingsEnum, ImpactsSource

Extension of PostingsEnum which also provides information about upcoming impacts.

ImpactsSource

interface ImpactsSource

Source of Impacts.

IndexableField

interface IndexableField

Represents a single field for indexing. IndexWriter consumes Iterable as a document.

IndexableFieldType

interface IndexableFieldType

Describes the properties of a field.

IndexCommit

abstract class IndexCommit : Comparable<IndexCommit?>

Expert: represents a single commit into an index as seen by the IndexDeletionPolicy or IndexReader.

IndexDeletionPolicy

abstract class IndexDeletionPolicy

Expert: policy for deletion of stale index commits.

IndexFileNames

object IndexFileNames

This class contains useful constants representing filenames and extensions used by lucene, as well as convenience methods for querying whether a file name matches an extension (.matchesExtension), as well as generating file names from a segment name, generation and extension ( .fileNameFromGeneration, .segmentFileName).

IndexFormatTooNewException

class IndexFormatTooNewException(val resourceDescription: String, val version: Int, val minVersion: Int, val maxVersion: Int) : IOException

This exception is thrown when Lucene detects an index that is newer than this Lucene version.

IndexFormatTooOldException

class IndexFormatTooOldException : IOException

This exception is thrown when Lucene detects an index that is too old for this Lucene version

IndexingChain

class IndexingChain(indexCreatedVersionMajor: Int, segmentInfo: SegmentInfo, directory: Directory, fieldInfos: FieldInfos.Builder, indexWriterConfig: LiveIndexWriterConfig, abortingExceptionConsumer: (Throwable) -> Unit) : Accountable

Default general purpose indexing chain, which handles indexing of all types of fields.

IndexNotFoundException

class IndexNotFoundException(msg: String?) : IOException

Signals that no index was found in the Directory. Possibly because the directory is empty, however can also indicate an index corruption.

IndexOptions

enum IndexOptions : Enum<IndexOptions>

Controls how much information is stored in the postings lists.

IndexReader

abstract class IndexReader : AutoCloseable

IndexReader is an abstract class, providing an interface for accessing a point-in-time view of an index. Any changes made to the index via IndexWriter will not be visible until a new IndexReader is opened. It's best to use DirectoryReader.open to obtain an IndexReader, if your IndexWriter is in-process. When you need to re-open to see changes to the index, it's best to use since the new reader will share resources with the previous one when possible. Search of an index is done entirely through this abstract interface, so that any subclass which implements it is searchable.

IndexReaderContext

abstract class IndexReaderContext

A struct-like class that represents a hierarchical relationship between IndexReader instances.

IndexSorter

interface IndexSorter

Handles how documents should be sorted in an index, both within a segment and between segments.

IndexWriter

open class IndexWriter(d: Directory, conf: IndexWriterConfig) : AutoCloseable, TwoPhaseCommit, Accountable, MergePolicy.MergeContext

An IndexWriter creates and maintains an index.

IndexWriterConfig

class IndexWriterConfig(analyzer: Analyzer = StandardAnalyzer()) : LiveIndexWriterConfig

Holds all the configuration that is used to create an IndexWriter. Once [ ] has been created with this object, changes to this object will not affect the [ ] instance. For that, use LiveIndexWriterConfig that is returned from IndexWriter.getConfig.

IndexWriterEventListener

interface IndexWriterEventListener

A callback event listener for recording key events happened inside IndexWriter

KeepOnlyLastCommitDeletionPolicy

class KeepOnlyLastCommitDeletionPolicy : IndexDeletionPolicy

This IndexDeletionPolicy implementation that keeps only the most recent commit and immediately removes all prior commits after a new commit is done. This is the default deletion policy.

KnnVectorValues

abstract class KnnVectorValues

This class abstracts addressing of document vector values indexed as KnnFloatVectorField or KnnByteVectorField.

LeafMetaData

class LeafMetaData(val createdVersionMajor: Int, val minVersion: Version?, val sort: Sort?, val hasBlocks: Boolean)

Provides read-only metadata about a leaf.

LeafReader

abstract class LeafReader : IndexReader

LeafReader is an abstract class, providing an interface for accessing an index. Search of an index is done entirely through this abstract interface, so that any subclass which implements it is searchable. IndexReaders implemented by this subclass do not consist of several sub-readers, they are atomic. They support retrieval of stored fields, doc values, terms, and postings.

LeafReaderContext

class LeafReaderContext : IndexReaderContext

IndexReaderContext for LeafReader instances.

LiveIndexWriterConfig

open class LiveIndexWriterConfig

Holds all the configuration used by IndexWriter with few setters for settings that can be changed on an IndexWriter instance "live".

LogByteSizeMergePolicy

class LogByteSizeMergePolicy : LogMergePolicy

This is a LogMergePolicy that measures size of a segment as the total byte size of the segment's files.

LogDocMergePolicy

class LogDocMergePolicy : LogMergePolicy

This is a LogMergePolicy that measures size of a segment as the number of documents (not taking deletions into account).

LogMergePolicy

abstract class LogMergePolicy : MergePolicy

This class implements a MergePolicy that tries to merge segments into levels of exponentially increasing size, where each level has fewer segments than the value of the merge factor. Whenever extra segments (beyond the merge factor upper bound) are encountered, all segments within the level are merged. You can get or set the merge factor using .getMergeFactor and .setMergeFactor respectively.

MappedMultiFields

class MappedMultiFields(val mergeState: MergeState, multiFields: MultiFields) : FilterLeafReader.FilterFields

A Fields implementation that merges multiple Fields into one, and maps around deleted documents. This is used for merging.

MergePolicy

abstract class MergePolicy

Expert: a MergePolicy determines the sequence of primitive merge operations.

MergeRateLimiter

class MergeRateLimiter(mergeProgress: MergePolicy.OneMergeProgress) : RateLimiter

This is the RateLimiter that IndexWriter assigns to each running merge, to give MergeSchedulers ionice like control.

MergeScheduler

abstract class MergeScheduler : AutoCloseable

Expert: IndexWriter uses an instance implementing this interface to execute the merges selected by a MergePolicy. The default MergeScheduler is [ ].

MergeState

class MergeState

Holds common state used during segment merging.

MergeTrigger

enum MergeTrigger : Enum<MergeTrigger>

MergeTrigger is passed to MergePolicy.findMerges to indicate the event that triggered the merge.

MultiBits

class MultiBits : Bits

Concatenates multiple Bits together, on every lookup.

MultiDocValues

object MultiDocValues

A wrapper for CompositeIndexReader providing access to DocValues.

MultiFields

class MultiFields(subs: Array<Fields>, subSlices: Array<ReaderSlice>) : Fields

Provides a single Fields term index view over an IndexReader. This is useful when you're interacting with an IndexReader implementation that consists of sequential sub-readers (eg DirectoryReader or MultiReader) and you must treat it as a [ ].

MultiPostingsEnum

class MultiPostingsEnum(parent: MultiTermsEnum, subReaderCount: Int) : PostingsEnum

Exposes PostingsEnum, merged from PostingsEnum API of sub-segments.

MultiReader

open class MultiReader(subReaders: Array<out IndexReader>, subReadersSorter: Comparator<IndexReader>?, closeSubReaders: Boolean) : BaseCompositeReader<IndexReader>

A CompositeReader which reads multiple indexes, appending their content. It can be used to create a view on several sub-readers (like DirectoryReader) and execute searches on it.

MultiTerms

class MultiTerms(val subTerms: Array<Terms>, val subSlices: Array<ReaderSlice>) : Terms

Exposes flex API, merged from flex API of sub-segments.

MultiTermsEnum

class MultiTermsEnum(slices: Array<ReaderSlice>) : BaseTermsEnum

Exposes TermsEnum API, merged from TermsEnum API of sub-segments. This does a merge sort, by term text, of the sub-readers.

NoDeletionPolicy

class NoDeletionPolicy : IndexDeletionPolicy

An IndexDeletionPolicy which keeps all index commits around, never deleting them. This class is a singleton and can be accessed by referencing .INSTANCE.

NoMergePolicy

class NoMergePolicy : MergePolicy

A MergePolicy which never returns merges to execute. Use it if you want to prevent segment merges.

NoMergeScheduler

class NoMergeScheduler : MergeScheduler

A MergeScheduler which never executes any merges. It is also a singleton and can be accessed through NoMergeScheduler.INSTANCE. Use it if you want to prevent an IndexWriter from ever executing merges, regardless of the MergePolicy used. Note that you can achieve the same thing by using NoMergePolicy, however with NoMergeScheduler you also ensure that no unnecessary code of any MergeScheduler implementation is ever executed. Hence it is recommended to use both if you want to disable merges from ever happening.

NumericDocValues

abstract class NumericDocValues : DocValuesIterator

A per-document numeric value.

OneMergeWrappingMergePolicy

open class OneMergeWrappingMergePolicy(in: MergePolicy, wrapOneMerge: (MergePolicy.OneMerge) -> MergePolicy.OneMerge) : FilterMergePolicy

A wrapping merge policy that wraps the MergePolicy.OneMerge objects returned by the wrapped merge policy.

OrdinalMap

class OrdinalMap : Accountable

Maps per-segment ordinals to/from global ordinal space, using a compact packed-ints representation.

OrdTermState

open class OrdTermState : TermState

An ordinal based TermState

ParallelCompositeReader

class ParallelCompositeReader(closeSubReaders: Boolean, readers: Array<CompositeReader>, storedFieldReaders: Array<CompositeReader>) : BaseCompositeReader<LeafReader>

An CompositeReader which reads multiple, parallel indexes. Each index added must have the same number of documents, and exactly the same number of leaves (with equal maxDoc), but typically each contains different fields. Deletions are taken from the first reader. Each document contains the union of the fields of all documents with the same document number. When searching, matches for a query term are from the first index added that has the field.

ParallelLeafReader

open class ParallelLeafReader(closeSubReaders: Boolean, readers: Array<LeafReader>, storedFieldsReaders: Array<LeafReader>) : LeafReader

An LeafReader which reads multiple, parallel indexes. Each index added must have the same number of documents, but typically each contains different fields. Deletions are taken from the first reader. Each document contains the union of the fields of all documents with the same document number. When searching, matches for a query term are from the first index added that has the field.

ParallelPostingsArray

open class ParallelPostingsArray(val size: Int)

PendingDeletes

open class PendingDeletes(val info: SegmentCommitInfo, initialLiveDocs: Bits? = null, liveDocsInitialized: Boolean = !info.hasDeletions())

This class handles accounting and applying pending deletes for live segment readers

PersistentSnapshotDeletionPolicy

class PersistentSnapshotDeletionPolicy : SnapshotDeletionPolicy

A SnapshotDeletionPolicy which adds a persistence layer so that snapshots can be maintained across the life of an application. The snapshots are persisted in a Directory and are committed as soon as snapshot or release is called.

PointValues

abstract class PointValues

Access to indexed numeric values.

PostingsEnum

abstract class PostingsEnum : DocIdSetIterator

Iterates through the postings. NOTE: you must first call .nextDoc before using any of the per-doc methods.

PrefixCodedTerms

class PrefixCodedTerms : Accountable

Prefix codes term instances (prefixes are shared). This is expected to be faster to build than a FST and might also be more compact if there are no common suffixes.

QueryTimeout

fun interface QueryTimeout

Query timeout abstraction that controls whether a query should continue or be stopped. Can be set to the searcher through org.gnit.lucenekmp.search.IndexSearcher.setTimeout, in which case bulk scoring will be time-bound. Can also be used in combination with [ ].

QueryTimeoutImpl

class QueryTimeoutImpl(timeAllowed: Long) : QueryTimeout

An implementation of QueryTimeout that can be used by the ExitableDirectoryReader class to time out and exit out when a query takes a long time to rewrite.

ReaderManager

class ReaderManager : ReferenceManager<DirectoryReader>

Utility class to safely share DirectoryReader instances across multiple threads, while periodically reopening. This class ensures each reader is closed only once all threads have finished using it.

ReadersAndUpdates

class ReadersAndUpdates(indexCreatedVersionMajor: Int, val info: SegmentCommitInfo, pendingDeletes: PendingDeletes)

ReaderSlice

data class ReaderSlice(val start: Int, val length: Int, val readerIndex: Int)

Subreader slice from a parent composite reader.

ReaderUtil

object ReaderUtil

Common util methods for dealing with IndexReaders and IndexReaderContexts.

SegmentCommitInfo

class SegmentCommitInfo(val info: SegmentInfo, delCount: Int, softDelCount: Int, delGen: Long, fieldInfosGen: Long, docValuesGen: Long, id: ByteArray?)

Embeds a read-only SegmentInfo and adds per-commit fields.

SegmentCoreReaders

class SegmentCoreReaders(dir: Directory, si: SegmentCommitInfo, context: IOContext)

Holds core readers that are shared (unchanged) when SegmentReader is cloned or reopened

SegmentDocValues

class SegmentDocValues

Manages the DocValuesProducer held by SegmentReader and keeps track of their reference counting.

SegmentInfo

class SegmentInfo(dir: Directory, version: Version, minVersion: Version?, name: String, maxDoc: Int, isCompoundFile: Boolean, hasBlocks: Boolean, codec: Codec?, diagnostics: MutableMap<String, String>, id: ByteArray, attributes: MutableMap<String, String>, indexSort: Sort?)

Information about a segment such as its name, directory, and files related to the segment.

SegmentInfos

class SegmentInfos(indexCreatedVersionMajor: Int) : Cloneable<SegmentInfos> , Iterable<SegmentCommitInfo>

A collection of segmentInfo objects with methods for operating on those segments in relation to the file system.

SegmentReader

class SegmentReader : CodecReader

IndexReader implementation over a single segment.

SegmentReadState

class SegmentReadState

Holder class for common parameters used during read.

SegmentWriteState

class SegmentWriteState

Holder class for common parameters used during write.

SerialMergeScheduler

open class SerialMergeScheduler : MergeScheduler

A MergeScheduler that simply does each merge sequentially, using the current thread.

SimpleMergedSegmentWarmer

class SimpleMergedSegmentWarmer(infoStream: InfoStream) : IndexWriter.IndexReaderWarmer

A very simple merged segment warmer that just ensures data structures are initialized.

SingleTermsEnum

class SingleTermsEnum(tenum: TermsEnum, termText: BytesRef?) : FilteredTermsEnum

Subclass of FilteredTermsEnum for enumerating a single term.

SlowCodecReaderWrapper

object SlowCodecReaderWrapper

Wraps arbitrary readers for merging. Note that this can cause slow and memory-intensive merges. Consider using FilterCodecReader instead.

SlowImpactsEnum

class SlowImpactsEnum(delegate: PostingsEnum) : ImpactsEnum

ImpactsEnum that doesn't index impacts but implements the API in a legal way. This is typically used for short postings that do not need skipping.

SnapshotDeletionPolicy

open class SnapshotDeletionPolicy(primary: IndexDeletionPolicy) : IndexDeletionPolicy

An IndexDeletionPolicy that wraps any other IndexDeletionPolicy and adds the ability to hold and later release snapshots of an index. While a snapshot is held, the [ ] will not remove any files associated with it even if the index is otherwise being actively, arbitrarily changed. Because we wrap another arbitrary IndexDeletionPolicy, this gives you the freedom to continue using whatever IndexDeletionPolicy you would normally want to use with your index.

SoftDeletesDirectoryReaderWrapper

class SoftDeletesDirectoryReaderWrapper : FilterDirectoryReader

This reader filters out documents that have a doc-values value in the given field and treats these documents as soft-deleted. Hard deleted documents will also be filtered out in the live docs of this reader.

SoftDeletesRetentionMergePolicy

class SoftDeletesRetentionMergePolicy(field: String, retentionQuerySupplier: () -> Query, in: MergePolicy) : OneMergeWrappingMergePolicy

This MergePolicy allows to carry over soft deleted documents across merges. The policy wraps the merge reader and marks documents as "live" that have a value in the soft delete field and match the provided query. This allows for instance to keep documents alive based on time or any other constraint in the index. The main purpose for this merge policy is to implement retention policies for document modification to vanish in the index. Using this merge policy allows to control when soft deletes are claimed by merges.

SortedDocValues

abstract class SortedDocValues : DocValuesIterator

A per-document byte[] with presorted values. This is fundamentally an iterator over the int ord values per document, with random access APIs to resolve an int ord to BytesRef.

SortedNumericDocValues

abstract class SortedNumericDocValues : DocValuesIterator

A list of per-document numeric values, sorted according to Long.compare.

SortedSetDocValues

abstract class SortedSetDocValues : DocValuesIterator

A multi-valued version of SortedDocValues.

Sorter

class Sorter

Sorts documents of a given index by returning a permutation on the document IDs.

SortFieldProvider

abstract class SortFieldProvider : NamedSPILoader.NamedSPI

Reads/Writes a named SortField from a segment info file, used to record index sorts

SortingCodecReader

class SortingCodecReader : FilterCodecReader

An CodecReader which supports sorting documents by a given [ ]. This can be used to re-sort and index after it's been created by wrapping all readers of the index with this reader and adding it to a fresh IndexWriter via . NOTE: This reader should only be used for merging. Pulling fields from this reader might be very costly and memory intensive.

StandardDirectoryReader

class StandardDirectoryReader : DirectoryReader

Default implementation of DirectoryReader.

StoredFieldDataInput

class StoredFieldDataInput(val in: DataInput, val length: Int)

A fixed size DataInput which includes the length of the input. For use as a StoredField.

StoredFields

abstract class StoredFields

API for reading stored fields.

StoredFieldsConsumer

open class StoredFieldsConsumer(val codec: Codec, val directory: Directory, val info: SegmentInfo)

StoredFieldVisitor

abstract class StoredFieldVisitor

Expert: provides a low-level means of accessing the stored field values in an index. See .

Term

class Term : Comparable<Term> , Accountable

A Term represents a word from text. This is the unit of search. It is composed of two elements, the text of the word, as a string, and the name of the field that the text occurred in.

Terms

abstract class Terms

Access to the terms in a specific field. See Fields.

TermsEnum

abstract class TermsEnum : BytesRefIterator

Iterator to seek (.seekCeil, .seekExact) or step through (.next terms to obtain frequency information (.docFreq), PostingsEnum or PostingsEnum for the current term (.postings.

TermsEnumIndex

open class TermsEnumIndex(var termsEnum: TermsEnum?, val subIndex: Int)

Wrapper around a TermsEnum and an integer that identifies it. All operations that move the current position of the TermsEnum must be performed via this wrapper class, not directly on the wrapped TermsEnum.

TermsHash

abstract class TermsHash(intBlockAllocator: IntBlockPool.Allocator, byteBlockAllocator: ByteBlockPool.Allocator, val bytesUsed: Counter, val nextTermsHash: TermsHash?)

This class is passed each token produced by the analyzer on each field during indexing, and it stores these tokens in a hash table, and allocates separate byte streams per token. Consumers of this class, eg FreqProxTermsWriter and TermVectorsConsumer, write their own byte streams under each term.

TermsHashPerField

abstract class TermsHashPerField(streamCount: Int, intPool: IntBlockPool, val bytePool: ByteBlockPool, termBytePool: ByteBlockPool, bytesUsed: Counter, val nextPerField: TermsHashPerField?, val fieldName: String, indexOptions: IndexOptions) : Comparable<TermsHashPerField>

This class stores streams of information per term without knowing the size of the stream ahead of time. Each stream typically encodes one level of information like term frequency per document or term proximity. Internally this class allocates a linked list of slices that can be read by a ByteSliceReader for each term. Terms are first deduplicated in a BytesRefHash once this is done internal data-structures point to the current offset of each stream that can be written to.

TermState

abstract class TermState : Cloneable<TermState>

Encapsulates all required internal state to position the associated TermsEnum without re-seeking.

TermStates

class TermStates

Maintains a IndexReader view over IndexReader instances containing a single term. The TermStates doesn't track if the given TermState objects are valid, neither if the TermState instances refer to the same terms in the associated readers.

TermVectors

abstract class TermVectors

API for reading term vectors.

TermVectorsConsumer

open class TermVectorsConsumer(intBlockAllocator: IntBlockPool.Allocator, byteBlockAllocator: ByteBlockPool.Allocator, directory: Directory, info: SegmentInfo, codec: Codec) : TermsHash

TermVectorsConsumerPerField

class TermVectorsConsumerPerField(invertState: FieldInvertState, termsHash: TermVectorsConsumer, fieldInfo: FieldInfo) : TermsHashPerField

TieredMergePolicy

open class TieredMergePolicy : MergePolicy

Merges segments of approximately equal size, subject to an allowed number of segments per tier. This is similar to LogByteSizeMergePolicy, except this merge policy is able to merge non-adjacent segment. This merge policy also does not over-merge (i.e. cascade merges).

TrackingTmpOutputDirectoryWrapper

class TrackingTmpOutputDirectoryWrapper(in: Directory) : FilterDirectory

TwoPhaseCommit

interface TwoPhaseCommit

An interface for implementations that support 2-phase commit. You can use [ ] to execute a 2-phase commit algorithm over several TwoPhaseCommits.

TwoPhaseCommitTool

object TwoPhaseCommitTool

A utility for executing 2-phase commit on several objects.

UpgradeIndexMergePolicy

class UpgradeIndexMergePolicy(in: MergePolicy) : FilterMergePolicy

This MergePolicy is used for upgrading all existing segments of an index when calling IndexWriter.forceMerge. All other methods delegate to the base MergePolicy given to the constructor. This allows for an as-cheap-as possible upgrade of an older index by only upgrading segments that are created by previous Lucene versions. forceMerge does no longer really merge; it is just used to "forceMerge" older segment versions away.

VectorEncoding

enum VectorEncoding : Enum<VectorEncoding>

The numeric datatype of the vector values.

VectorSimilarityFunction

Vector similarity function; used in search to return top K most similar vectors to a target vector. This is a label describing the method used during indexing and searching of the vectors in order to determine the nearest neighbors.

VectorValuesConsumer

class VectorValuesConsumer(codec: Codec, directory: Directory, segmentInfo: SegmentInfo, infoStream: InfoStream)

Streams vector values for indexing to the given codec's vectors writer. The codec's vectors writer is responsible for buffering and processing vectors.