Package-level declarations

Types

Link copied to clipboard
abstract class AbstractDictionary

SmartChineseAnalyzer abstract dictionary implementation.

Link copied to clipboard

SmartChineseAnalyzer Bigram dictionary.

Link copied to clipboard
class BiSegGraph(segGraph: SegGraph)

Graph representing possible token pairs (bigrams) at each start offset in the sentence.

Link copied to clipboard

Finds the optimal segmentation of a sentence into Chinese words

Link copied to clipboard

SmartChineseAnalyzer internal node representation

Link copied to clipboard
class SegGraph

Graph representing possible tokens at each start offset in the sentence.

Link copied to clipboard
class SegToken(var charArray: CharArray, var startOffset: Int, var endOffset: Int, var wordType: Int, var weight: Int)

SmartChineseAnalyzer internal token

Link copied to clipboard

Filters a SegToken by converting full-width latin to half-width, then lowercasing latin. Additionally, all punctuation is converted into Utility.COMMON_DELIMITER

Link copied to clipboard
class SegTokenPair(var charArray: CharArray, var from: Int, var to: Int, var weight: Double)

A pair of tokens in SegGraph

Link copied to clipboard

SmartChineseAnalyzer Word Dictionary