BulgarianAnalyzer

Analyzer for Bulgarian.

This analyzer implements light-stemming as specified by: Searching Strategies for the Bulgarian Language http://members.unine.ch/jacques.savoy/Papers/BUIR.pdf

Since

3.1

Constructors

Link copied to clipboard
constructor()

Builds an analyzer with the default stop words: {@link #DEFAULT_STOPWORD_FILE}.

constructor(stopwords: CharArraySet)

Builds an analyzer with the given stop words.

constructor(stopwords: CharArraySet, stemExclusionSet: CharArraySet)

Builds an analyzer with the given stop words and a stem exclusion set. If a stem exclusion set is provided this analyzer will add a SetKeywordMarkerFilter before BulgarianStemFilter.

Types

Link copied to clipboard
object Companion

File containing default Bulgarian stopwords.

Properties

Link copied to clipboard
Link copied to clipboard
Link copied to clipboard

Functions

Link copied to clipboard
open override fun close()
Link copied to clipboard
open fun getOffsetGap(fieldName: String?): Int
Link copied to clipboard
open fun getPositionIncrementGap(fieldName: String?): Int
Link copied to clipboard
fun normalize(fieldName: String, text: String): BytesRef
Link copied to clipboard
fun tokenStream(fieldName: String, text: String): TokenStream
fun tokenStream(fieldName: String, reader: Reader): TokenStream