ArabicAnalyzer

Analyzer for Arabic.

This analyzer implements light-stemming as specified by: * Light Stemming for Arabic Information Retrieval * http://www.mtholyoke.edu/~lballest/Pubs/arab_stem05.pdf

The analysis package contains three primary components:

Constructors

Link copied to clipboard
constructor(stopwords: CharArraySet, stemExclusionSet: CharArraySet)

Builds an analyzer with the given stop word and stem exclusion set.

constructor(stopwords: CharArraySet)

Builds an analyzer with the given stop words.

constructor()

Builds an analyzer with the default stop words: DEFAULT_STOPWORD_FILE.

Types

Link copied to clipboard
object Companion

Properties

Link copied to clipboard
Link copied to clipboard
Link copied to clipboard

Functions

Link copied to clipboard
open override fun close()
Link copied to clipboard
open fun getOffsetGap(fieldName: String?): Int
Link copied to clipboard
open fun getPositionIncrementGap(fieldName: String?): Int
Link copied to clipboard
fun normalize(fieldName: String, text: String): BytesRef
Link copied to clipboard
fun tokenStream(fieldName: String, text: String): TokenStream
fun tokenStream(fieldName: String, reader: Reader): TokenStream