Package-level declarations

Types

Link copied to clipboard

This attribute can be used if you have the raw term bytes to be indexed. It can be used as replacement for CharTermAttribute, if binary terms should be indexed.

Link copied to clipboard

The term text of a Token.

Link copied to clipboard

This attribute can be used to pass different flags down the tokenizer chain.

Link copied to clipboard
Link copied to clipboard

This attribute can be used to mark a token as a keyword. Keyword aware TokenStreams can decide to modify a token based on the return value of .isKeyword if the token is modified. Stemming filters for instance can use this attribute to conditionally skip a term if .isKeyword returns true.

Link copied to clipboard
Link copied to clipboard

The start and end character offset of a Token.

Link copied to clipboard
Link copied to clipboard
Link copied to clipboard

The payload of a Token.

Link copied to clipboard
Link copied to clipboard

Determines the position of this token relative to the previous Token in a TokenStream, used in phrase searching.

Link copied to clipboard

Determines how many positions this token spans. Very few analyzer components actually produce this attribute, and indexing ignores it, but it's useful to express the graph structure naturally produced by decompounding, word splitting/joining, synonym filtering, etc.

Link copied to clipboard

Sets the custom term frequency of a term within one document. If this attribute is present in your analysis chain for a given field, that field must be indexed with IndexOptions.DOCS_AND_FREQS.

Link copied to clipboard

This attribute is requested by TermsHashPerField to index the contents. This attribute can be used to customize the final byte[] encoding of terms.

Link copied to clipboard

A Token's lexical type. The Default value is "word".

Link copied to clipboard

Default implementation of TypeAttribute.