KhmerAnalyzer
Analyzer for Khmer text.
Tokenizes text into grapheme clusters using GraphemeClusterTokenizer, optionally applies character-level normalization via KhmerNormalizationCharFilter before tokenization, and then reorders characters within each token using CharReorderFilter.
Parameters
normalizationlevel
normalization level: 0 = none, 1 = formally confusable (default), 2 = also informally confusable, 3 = also digit mapping and more