Limit the default token length.
Acts similar to KeywordTokenizer
Acts like LetterTokenizer (partial Unicode :Letter:)
Acts similar to WhitespaceTokenizer