Package-level declarations

Types

Link copied to clipboard
Link copied to clipboard
class Dictionary(val fsa: FSA, val metadata: DictionaryMetadata)

A dictionary combines FSA automaton and DictionaryMetadata describing the way terms are encoded in the automaton.

Link copied to clipboard
Link copied to clipboard
class DictionaryIterator(dictionary: Dictionary, decoder: CharsetDecoder, decodeStems: Boolean) : Iterator<WordData>

An iterator over WordData entries of a Dictionary.

Link copied to clipboard

Dictionary lookup of an inflected word over a dictionary.

Link copied to clipboard

Description of attributes, their types and default values.

Link copied to clipboard

Helper class to build DictionaryMetadata instances.

Link copied to clipboard
Link copied to clipboard

The logic of encoding one sequence of bytes relative to another sequence of bytes. The "base" form and the "derived" form are typically the stem of a word and the inflected form of a word.

Link copied to clipboard
interface IStemmer

A generic "stemmer" interface in Morfologik.

Link copied to clipboard

No relative encoding at all (full target form is returned).

Link copied to clipboard

Encodes dst relative to src by trimming whatever non-equal suffix and infix src and dst have.

Link copied to clipboard

Encodes dst relative to src by trimming whatever non-equal suffix and prefix src and dst have.

Link copied to clipboard

Encodes dst relative to src by trimming whatever non-equal suffix src has.

Link copied to clipboard

Thrown when some input cannot be mapped using the declared charset (bytes to characters or the other way around).

Link copied to clipboard

Stem and tag data associated with a given word.