Package-level declarations

Types

Link copied to clipboard
abstract class BinaryDictionary<T : MorphData>(targetMapResource: IOSupplier<InputStream>, dictResource: IOSupplier<InputStream>, targetMapCodecHeader: String, dictCodecHeader: String, dictCodecVersion: Int) : Dictionary<T>

Abstract dictionary base class.

Link copied to clipboard
abstract class BinaryDictionaryWriter<T : BinaryDictionary<out MorphData>>(implClazz: KClass<T>, entryWriter: DictionaryEntryWriter)

Abstract base dictionary writer class.

Link copied to clipboard
abstract class CharacterDefinition(charDefResource: IOSupplier<InputStream>, charDefCodecHeader: String, charDefCodecVersion: Int, classCount: Int)

Character category data.

Link copied to clipboard
class CharacterDefinitionWriter<T : CharacterDefinition>(implClazz: KClass<T>, defaultValue: Byte, classCount: Int, lookupCharClass: CharacterDefinition.LookupCharacterClass)

Writes character definition file

Link copied to clipboard
abstract class ConnectionCosts(connectionCostResource: IOSupplier<InputStream>, connectionCostsCodecHeader: String, dictCodecVersion: Int)

n-gram connection cost data

Link copied to clipboard
class ConnectionCostsWriter<T : ConnectionCosts>(implClazz: KClass<T>, forwardSize: Int, backwardSize: Int)

Writes connection costs

Link copied to clipboard
interface Dictionary<T : MorphData>

High-level dictionary interface for morphological analyzers.

Link copied to clipboard
abstract class DictionaryEntryWriter(size: Int)

Abstract writer class to write dictionary entries.

Link copied to clipboard

Outputs the dot (graphviz) string for the viterbi lattice.

Link copied to clipboard
interface MorphData

High-level interface that represents morphological information in a dictionary

Link copied to clipboard
abstract class Token(val surfaceForm: CharArray, val offset: Int, val length: Int, val startOffset: Int, val endOffset: Int, val type: TokenType)

Analyzed token with morphological data.

Link copied to clipboard
abstract class TokenInfoFST(fst: FST<Long>, cacheCeiling: Int, cacheFloor: Int)

Thin wrapper around an FST with root-arc caching.

Link copied to clipboard

Token type reflecting the original source of this token

Link copied to clipboard
abstract class Viterbi<T : Token, U : Viterbi.Position>(fst: TokenInfoFST, fstReader: FST.BytesReader, dictionary: BinaryDictionary<out MorphData>, userFST: TokenInfoFST?, userFSTReader: FST.BytesReader?, userDictionary: Dictionary<out MorphData>?, costs: ConnectionCosts, positionFactory: () -> U)

Performs Viterbi algorithm for morphological Tokenizers.

Link copied to clipboard
abstract class ViterbiNBest<T : Token, U : MorphData>(fst: TokenInfoFST, fstReader: FST.BytesReader, dictionary: BinaryDictionary<out MorphData>, userFST: TokenInfoFST?, userFSTReader: FST.BytesReader?, userDictionary: Dictionary<out MorphData>?, costs: ConnectionCosts) : Viterbi<T, ViterbiNBest.PositionNBest>

Viterbi subclass for n-best path calculation.