SmartChineseAnalyzer internal token
Character array containing token text
end offset into original sentence
during segmentation, this is used to store the index of the token in the token list table
start offset into original sentence
word frequency
WordType of the text