Package-level declarations
Types
Link copied to clipboard
class BibleKoreanAnalyzer(userDict: UserDictionary? = BibleKoreanUserDictionary.instance, mode: KoreanTokenizer.DecompoundMode = KoreanTokenizer.DEFAULT_DECOMPOUND, stopTags: Set<POS.Tag> = KoreanPartOfSpeechStopFilter.DEFAULT_STOP_TAGS, outputUnknownUnigrams: Boolean = false, stopWords: Set<String> = setOf("의")) : Analyzer
Analyzer for Korean that uses morphological analysis. Adds optional stop words (default includes the possessive particle "의").
Link copied to clipboard
Very small stop filter that removes tokens whose term (already lowercased) is in the provided set.