Package-level declarations
Types
Base class for testing tokenstream factories.
Base class for all Lucene unit tests that use TokenStreams.
TokenStream from a canned list of Tokens.
Throws IOException from random Tokenstream methods.
Simplified Kotlin port of Lucene's LookaheadTokenFilter.
Analyzer for testing purposes.
Analyzer for testing that encodes terms as UTF-16 bytes.
the purpose of this charfilter is to send offsets out of bounds if the analyzer doesn't use correctOffset or does incorrect offset math.
TokenFilter that adds random fixed-length payloads.
Randomly inserts overlapping tokens with variable position length.
Randomly injects holes (similar to what a stopfilter would do)
A lowercasing org.gnit.lucenekmp.analysis.TokenFilter.
Wraps a whitespace tokenizer with a filter that sets the first token, and odd tokens to posinc=1, and all others to 0, encoding the position as pos: XXX in the payload.
Wraps a Reader, and can throw random or fixed exceptions, and spoon feed read chars.
adds synonym of "dog" for "dogs", and synonym of "cavy" for "guinea pig".
adds synonym of "dog" for "dogs", and synonym of "cavy" for "guinea pig".
A token filter for testing that removes terms accepted by a DFA.
Tokenizer for testing.
Extension of CharTermAttributeImpl that encodes the term text as UTF-16 bytes instead of as UTF-8 bytes.
TokenFilter that adds random variable-length payloads.
A Token is an occurrence of a term from the text of a field. It consists of the term's text, start and end offsets, and optionally flags and payload.
Consumes a TokenStream and outputs the dot (graphviz) string (graph).
Simple example of a filter that exercises LookaheadTokenFilter.
A TokenFilter that checks consistency of the tokens (eg offsets are consistent with one another).