CJKBigramFilter

constructor(input: TokenStream)

Calls CJKBigramFilter with default flags.


constructor(input: TokenStream, flags: Int)

Calls CJKBigramFilter with outputUnigrams set to false.


constructor(input: TokenStream, flags: Int, outputUnigrams: Boolean)

Create a new CJKBigramFilter, specifying which writing systems should be bigrammed, and whether or not unigrams should also be output.

Parameters

flags

OR'ed set from HAN, HIRAGANA, KATAKANA, HANGUL

outputUnigrams

true if unigrams for the selected writing systems should also be output. when this is false, this is only done when there are no adjacent characters to form a bigram.