Package-level declarations

Types

Link copied to clipboard
class BibleKoreanAnalyzer(userDict: UserDictionary? = BibleKoreanUserDictionary.instance, mode: KoreanTokenizer.DecompoundMode = KoreanTokenizer.DEFAULT_DECOMPOUND, stopTags: Set<POS.Tag> = KoreanPartOfSpeechStopFilter.DEFAULT_STOP_TAGS, outputUnknownUnigrams: Boolean = false, stopWords: Set<String> = setOf("의")) : Analyzer

Analyzer for Korean that uses morphological analysis. Adds optional stop words (default includes the possessive particle "의").

Link copied to clipboard
class SimpleStopFilter(input: TokenStream, stopWords: Set<String>) : TokenStream

Very small stop filter that removes tokens whose term (already lowercased) is in the provided set.