ClassicAnalyzer

Filters ClassicTokenizer with ClassicFilter, LowerCaseFilter and StopFilter, using a list of English stop words.

ClassicAnalyzer was named StandardAnalyzer in Lucene versions prior to 3.1. As of 3.1, [StandardAnalyzer] implements Unicode text segmentation, as specified by UAX#29.

Since

3.1

Constructors

Link copied to clipboard
constructor(stopWords: CharArraySet)

An unmodifiable set containing some common English words that are usually not useful for searching.

constructor()

Builds an analyzer with the default stop words (STOP_WORDS_SET).

constructor(stopwords: Reader)

Builds an analyzer with the stop words from the given reader.

Types

Link copied to clipboard
object Companion

Properties

Link copied to clipboard
Link copied to clipboard
Link copied to clipboard

Functions

Link copied to clipboard
open override fun close()
Link copied to clipboard
Link copied to clipboard
open fun getOffsetGap(fieldName: String?): Int
Link copied to clipboard
open fun getPositionIncrementGap(fieldName: String?): Int
Link copied to clipboard
fun normalize(fieldName: String, text: String): BytesRef
Link copied to clipboard
fun setMaxTokenLength(length: Int)

Set maximum allowed token length. If a token is seen that exceeds this length then it is discarded. This setting only takes effect the next time tokenStream or tokenStream is called.

Link copied to clipboard
fun tokenStream(fieldName: String, text: String): TokenStream
fun tokenStream(fieldName: String, reader: Reader): TokenStream