WordDelimiterIterator
class WordDelimiterIterator(charTypeTable: ByteArray, val splitOnCaseChange: Boolean, val splitOnNumerics: Boolean, val stemEnglishPossessive: Boolean)
A BreakIterator-like API for iterating over subwords in text, according to WordDelimiterGraphFilter rules.
Properties
Link copied to clipboard
If false, causes case changes to be ignored (subwords will only be generated given SUBWORD_DELIM tokens). (Defaults to true)
Link copied to clipboard
If false, causes numeric changes to be ignored (subwords will only be generated given SUBWORD_DELIM tokens). (Defaults to true)
Link copied to clipboard
start position of text, excluding leading delimiters
Link copied to clipboard
If true, causes trailing "'s" to be removed for each subword. (Defaults to true)