common/org.gnit.lucenekmp.analysis.classic/ClassicAnalyzer

ClassicAnalyzer

class ClassicAnalyzer : StopwordAnalyzerBase

Filters ClassicTokenizer with ClassicFilter, LowerCaseFilter and StopFilter, using a list of English stop words.

ClassicAnalyzer was named StandardAnalyzer in Lucene versions prior to 3.1. As of 3.1, [StandardAnalyzer] implements Unicode text segmentation, as specified by UAX#29.

Since

3.1

Constructors

ClassicAnalyzer

constructor(stopWords: CharArraySet)

An unmodifiable set containing some common English words that are usually not useful for searching.

constructor()

Builds an analyzer with the default stop words (STOP_WORDS_SET).

constructor(stopwords: Reader)

Builds an analyzer with the stop words from the given reader.

Types

Companion

object Companion

Properties

reuseStrategy

val reuseStrategy: Analyzer.ReuseStrategy

stopwords

val stopwords: CharArraySet

storedValue

var storedValue: CloseableThreadLocal<Any>?

Functions

open override fun close()

getMaxTokenLength

fun getMaxTokenLength(): Int

getOffsetGap

open fun getOffsetGap(fieldName: String?): Int

getPositionIncrementGap

open fun getPositionIncrementGap(fieldName: String?): Int

normalize

fun normalize(fieldName: String, text: String): BytesRef

setMaxTokenLength

fun setMaxTokenLength(length: Int)

Set maximum allowed token length. If a token is seen that exceeds this length then it is discarded. This setting only takes effect the next time tokenStream or tokenStream is called.

tokenStream

fun tokenStream(fieldName: String, text: String): TokenStream

fun tokenStream(fieldName: String, reader: Reader): TokenStream