Wraps a whitespace tokenizer with a filter that sets the first token, and odd tokens to posinc=1, and all others to 0, encoding the position as pos: XXX in the payload.

MockReaderWrapper

class MockReaderWrapper(random: Random, in: Reader) : Reader

Wraps a Reader, and can throw random or fixed exceptions, and spoon feed read chars.

MockSynonymAnalyzer

class MockSynonymAnalyzer : Analyzer

adds synonym of "dog" for "dogs", and synonym of "cavy" for "guinea pig".

MockSynonymFilter

class MockSynonymFilter(input: TokenStream) : TokenFilter

adds synonym of "dog" for "dogs", and synonym of "cavy" for "guinea pig".

MockTokenFilter

class MockTokenFilter(input: TokenStream, filter: CharacterRunAutomaton) : TokenFilter

A token filter for testing that removes terms accepted by a DFA.

MockTokenizer

class MockTokenizer : Tokenizer

Tokenizer for testing.

MockUTF16TermAttributeImpl

class MockUTF16TermAttributeImpl : CharTermAttributeImpl

Extension of CharTermAttributeImpl that encodes the term text as UTF-16 bytes instead of as UTF-8 bytes.

MockVariableLengthPayloadFilter

class MockVariableLengthPayloadFilter(random: Random, in: TokenStream) : TokenFilter

TokenFilter that adds random variable-length payloads.

Token

class Token : PackedTokenAttributeImpl, FlagsAttribute, PayloadAttribute

A Token is an occurrence of a term from the text of a field. It consists of the term's text, start and end offsets, and optionally flags and payload.

TokenStreamToDot

class TokenStreamToDot(inputText: String, in: TokenStream, out: PrintWriter)

Consumes a TokenStream and outputs the dot (graphviz) string (graph).

TrivialLookaheadFilter

class TrivialLookaheadFilter(input: TokenStream) : LookaheadTokenFilter<TrivialLookaheadFilter.TestPosition>

Simple example of a filter that exercises LookaheadTokenFilter.

ValidatingTokenFilter

class ValidatingTokenFilter(input: TokenStream, name: String) : TokenFilter

A TokenFilter that checks consistency of the tokens (eg offsets are consistent with one another).