Package-level declarations
Types
Link copied to clipboard
SmartChineseAnalyzer abstract dictionary implementation.
Link copied to clipboard
SmartChineseAnalyzer Bigram dictionary.
Link copied to clipboard
Graph representing possible token pairs (bigrams) at each start offset in the sentence.
Link copied to clipboard
class HHMMSegmenter
Finds the optimal segmentation of a sentence into Chinese words
Link copied to clipboard
SmartChineseAnalyzer internal node representation
Link copied to clipboard
class SegTokenFilter
Filters a SegToken by converting full-width latin to half-width, then lowercasing latin. Additionally, all punctuation is converted into Utility.COMMON_DELIMITER
Link copied to clipboard
A pair of tokens in SegGraph
Link copied to clipboard
SmartChineseAnalyzer Word Dictionary