KoreanTokenizer

Tokenizer for Korean that uses morphological analysis.

Constructors

Link copied to clipboard
constructor()

Creates a new KoreanTokenizer with default parameters.

constructor(factory: AttributeFactory, userDictionary: UserDictionary?, mode: KoreanTokenizer.DecompoundMode, outputUnknownUnigrams: Boolean)

Create a new KoreanTokenizer using the system and unknown dictionaries shipped with Lucene.

constructor(factory: AttributeFactory, userDictionary: UserDictionary?, mode: KoreanTokenizer.DecompoundMode, outputUnknownUnigrams: Boolean, discardPunctuation: Boolean)

Create a new KoreanTokenizer using the system and unknown dictionaries shipped with Lucene.

constructor(factory: AttributeFactory, systemDictionary: TokenInfoDictionary, unkDictionary: UnknownDictionary, connectionCosts: ConnectionCosts, userDictionary: UserDictionary?, mode: KoreanTokenizer.DecompoundMode, outputUnknownUnigrams: Boolean, discardPunctuation: Boolean)

Create a new KoreanTokenizer supplying a custom system dictionary and unknown dictionary.

Types

Link copied to clipboard
object Companion
Link copied to clipboard

Decompound mode: this determines how the tokenizer handles POS.Type.COMPOUND, POS.Type.INFLECT and POS.Type.PREANALYSIS tokens.

Properties

Functions

Link copied to clipboard
fun <T : Attribute> addAttribute(attClass: KClass<T>): T
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
open override fun close()
Link copied to clipboard
fun copyTo(target: AttributeSource)
Link copied to clipboard
open override fun end()
Link copied to clipboard
Link copied to clipboard
open operator override fun equals(obj: Any?): Boolean
Link copied to clipboard
fun <T : Attribute> getAttribute(attClass: KClass<T>): T?
Link copied to clipboard
fun hasAttribute(attClass: KClass<out Attribute>): Boolean
Link copied to clipboard
Link copied to clipboard
open override fun hashCode(): Int
Link copied to clipboard
open override fun incrementToken(): Boolean
Link copied to clipboard
fun reflectAsString(prependAttClass: Boolean): String
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
open override fun reset()
Link copied to clipboard
Link copied to clipboard

Expert: set this to produce graphviz (dot) output of the Viterbi lattice

Link copied to clipboard
fun setReader(input: Reader)
Link copied to clipboard
open override fun toString(): String