WhitespaceTokenizerFactory
Factory for WhitespaceTokenizer.
<fieldType name="text_ws" class="solr.TextField" positionIncrementGap="100">
<analyzer>
<tokenizer class="solr.WhitespaceTokenizerFactory" rule="unicode" maxTokenLen="256"/>
</analyzer>
</fieldType>Options:
- rule: either "java" for [WhitespaceTokenizer] or "unicode" for [UnicodeWhitespaceTokenizer]
- maxTokenLen: max token length, should be greater than 0 and less than MAX_TOKEN_LENGTH_LIMIT (1024*1024). It is rare to need to change this else [ CharTokenizer]::DEFAULT_MAX_TOKEN_LEN
Since
3.1