CharacterUtils
object CharacterUtils
Utility class to write tokenizers or token filters.
Types
Link copied to clipboard
class CharacterBuffer
A simple IO buffer to use with CharacterUtils.fill.
Functions
Link copied to clipboard
fun fill(buffer: CharacterUtils.CharacterBuffer, reader: Reader, numChars: Int = buffer.buffer.size): Boolean
Convenience method which calls fill(buffer, reader, buffer.buffer.length).
Link copied to clipboard
Creates a new CharacterBuffer and allocates a char[] of the given bufferSize.
Link copied to clipboard
Link copied to clipboard
Converts each unicode codepoint to lowerCase via Character.toLowerCase starting at the given offset.
Link copied to clipboard
Converts each unicode codepoint to UpperCase via Character.toUpperCase starting at the given offset.