PersianAnalyzer
Analyzer for Persian.
This Analyzer uses PersianCharFilter which implies tokenizing around zero-width non-joiner in addition to whitespace. Some persian-specific variant forms (such as farsi yeh and keheh) are standardized. "Stemming" is accomplished via stopwords.
Constructors
Link copied to clipboard
Builds an analyzer with the given stop word and stem exclusion set.
Builds an analyzer with the given stop words.
constructor()
Builds an analyzer with the default stop words: DEFAULT_STOPWORD_FILE.