A B C D E F G H I L M N O P R S T U V W

A

arcs - Variable in class morfologik.fsa.FSAVer5Impl
An array of bytes with the internal representation of the automaton.
arcSize - Variable in class morfologik.fsa.FSAVer5Impl
Size of a single arc (in bytes).
Arrays - Class in morfologik.util
Compatibility layer for JVM 1.5.
assertExists(File, boolean, boolean) - Static method in class morfologik.util.FileUtils
Checks if the given file exists.
ATTR_NAME_ENCODING - Static variable in class morfologik.stemming.DictionaryMetadata
Attribute name for DictionaryMetadata.encoding.
ATTR_NAME_SEPARATOR - Static variable in class morfologik.stemming.DictionaryMetadata
Attribute name for DictionaryMetadata.separator.
ATTR_NAME_USES_INFIXES - Static variable in class morfologik.stemming.DictionaryMetadata
Attribute name for DictionaryMetadata.usesInfixes.
ATTR_NAME_USES_PREFIXES - Static variable in class morfologik.stemming.DictionaryMetadata
Attribute name for DictionaryMetadata.usesPrefixes.

B

bits - Variable in enum morfologik.fsa.FSAFlags
Bit mask for the corresponding flag.
BufferUtils - Class in morfologik.util
Utility functions for buffers.

C

clone() - Method in class morfologik.stemming.WordData
Declare a covariant of Object.clone() that returns a deep copy of this object.
close(Closeable...) - Static method in class morfologik.util.FileUtils
Force any non-null closeables.
copyOf(byte[], int) - Static method in class morfologik.util.Arrays
 
copyOf(int[], int) - Static method in class morfologik.util.Arrays
 
copyOf(T[], int) - Static method in class morfologik.util.Arrays
 
copyOf(U[], int, Class<? extends T[]>) - Static method in class morfologik.util.Arrays
 

D

decodeStem(ByteBuffer, byte[], int, ByteBuffer, DictionaryMetadata) - Static method in class morfologik.stemming.DictionaryLookup
Decode the base form of an inflected word and save its decoded form into a byte buffer.
defaultDictionaries - Static variable in class morfologik.stemming.Dictionary
Default loaded dictionaries.
Dictionary - Class in morfologik.stemming
A dictionary combines FSA automaton and metadata describing the internals of dictionary entries' coding (DictionaryMetadata.
Dictionary(FSA, DictionaryMetadata) - Constructor for class morfologik.stemming.Dictionary
It is strongly recommended to use static methods in this class for reading dictionaries.
DictionaryIterator - Class in morfologik.stemming
An iterator over WordData entries of a Dictionary.
DictionaryIterator(Dictionary, CharsetDecoder, boolean) - Constructor for class morfologik.stemming.DictionaryIterator
 
DictionaryLookup - Class in morfologik.stemming
This class implements a dictionary lookup over an FSA dictionary.
DictionaryLookup(Dictionary) - Constructor for class morfologik.stemming.DictionaryLookup
Creates a new object of this class using the given FSA for word lookups and encoding for converting characters to bytes.
DictionaryMetadata - Class in morfologik.stemming
Description of attributes, their types and default values.
DictionaryMetadata(char, String, boolean, boolean, Map<String, String>) - Constructor for class morfologik.stemming.DictionaryMetadata
Creates an immutable instance of DictionaryMetadata.
DumpTool - Class in morfologik.tools
This utility will dump the information and contents of a given FSA dictionary.
DumpTool() - Constructor for class morfologik.tools.DumpTool
 

E

elemsPerSecond(int) - Method in class morfologik.util.PerformanceTimer
 
encoding - Variable in class morfologik.stemming.DictionaryMetadata
Encoding used for converting bytes to characters and vice versa.
ensureCapacity(ByteBuffer, int) - Static method in class morfologik.util.BufferUtils
Ensure the byte buffer's capacity.
ensureCapacity(CharBuffer, int) - Static method in class morfologik.util.BufferUtils
Ensure the char buffer's capacity.
equals(Object) - Method in class morfologik.stemming.WordData
 

F

FileUtils - Class in morfologik.util
Utility functions.
filler - Variable in class morfologik.fsa.FSA
The meaning of this field is not clear (check the FSA documentation).
flagsToString(int) - Static method in class morfologik.fsa.FSAHelpers
Converts an integer with FSA flags to a human-readable string.
FSA - Class in morfologik.fsa
This class implements Finite State Automaton traversal as described in Jan Daciuk's Incremental Construction of Finite-State Automata and Transducers, and Their Use in the Natural Language Processing (PhD thesis, Technical University of Gdansk).
FSA(InputStream, String) - Constructor for class morfologik.fsa.FSA
Creates a new automaton reading the FSA automaton from an input stream.
fsa - Variable in class morfologik.stemming.Dictionary
FSA automaton with the compiled dictionary data.
FSAFinalStatesIterator - Class in morfologik.fsa
An iterator that traverses all final states reachable from a given node and returns byte sequences corresponding to final states.
FSAFlags - Enum in morfologik.fsa
FSA automaton flags.
FSAHelpers - Class in morfologik.fsa
This class has several static utility methods for use with the FSA package.
FSAMatch - Class in morfologik.fsa
A matching result returned from FSATraversalHelper.
FSAMatch() - Constructor for class morfologik.fsa.FSAMatch
 
FSAMatchType - Enum in morfologik.fsa
Type of the match returned as part of FSAMatch.
FSATraversalHelper - Class in morfologik.fsa
This class implements some common matching and scanning operations on a generic FSA.
FSAVer5Impl - Class in morfologik.fsa
FSA (Finite State Automaton) dictionary traversal implementation for version 5 of the FSA automaton.
FSAVer5Impl(InputStream, String) - Constructor for class morfologik.fsa.FSAVer5Impl
Creates a new automaton reading it from a file in FSA format, version 5.

G

getAllSubsequences(int) - Method in class morfologik.fsa.FSATraversalHelper
Returns an Iterator of all subsequences available from the given node to all reachable final states.
getAnnotationSeparator() - Method in class morfologik.fsa.FSA
Return the annotation separator character, converted to a character according to the encoding scheme passed in in the constructor of this class.
getArc(int, byte) - Method in class morfologik.fsa.FSA
Returns the identifier of an arc leaving node and labeled with label.
getArc(int, byte) - Method in class morfologik.fsa.FSAVer5Impl
 
getArcLabel(int) - Method in class morfologik.fsa.FSA
Return the label associated with a given arc.
getArcLabel(int) - Method in class morfologik.fsa.FSAVer5Impl
 
getDictionary() - Method in class morfologik.stemming.DictionaryLookup
 
getEndNode(int) - Method in class morfologik.fsa.FSA
Return the end node pointed to by a given arc.
getEndNode(int) - Method in class morfologik.fsa.FSAVer5Impl
 
getExpectedFeaturesName(String) - Static method in class morfologik.stemming.Dictionary
Returns the expected name of the metadata file, based on the name of the FSA dictionary file.
getFillerCharacter() - Method in class morfologik.fsa.FSA
Return the filler character, converted to a character according to the encoding scheme passed in in the constructor of this class.
getFinalStatesIterator() - Method in class morfologik.fsa.FSATraversalHelper
Returns a new iterator for walking along the final states of this FSA.
getFirstArc(int) - Method in class morfologik.fsa.FSA
Returns the identifier of the first arc leaving node or 0 if the node has no outgoing arcs.
getFirstArc(int) - Method in class morfologik.fsa.FSAVer5Impl
 
getFlags() - Method in class morfologik.fsa.FSA
Returns a set of flags for this FSA instance.
getFlags(int) - Static method in class morfologik.fsa.FSAHelpers
Returns flags as an integer for a given version number.
getForLanguage(String) - Static method in class morfologik.stemming.Dictionary
Return a built-in dictionary for a given ISO language code.
getInstance(File, String) - Static method in class morfologik.fsa.FSA
This static method will attempt to instantiate an appropriate implementation of the FSA for the version found in file given in the input argument.
getInstance(InputStream, String) - Static method in class morfologik.fsa.FSA
This static method will attempt to instantiate an appropriate implementation of the FSA for the version found in file given in the input argument.
getMatchType() - Method in class morfologik.fsa.FSAMatch
Return match type.
getMismatchIndex() - Method in class morfologik.fsa.FSAMatch
Return the index at which a mismatch occurred.
getMismatchNode() - Method in class morfologik.fsa.FSAMatch
Return the node at which mismatch occurred.
getNextArc(int, int) - Method in class morfologik.fsa.FSA
Returns the identifier of the next arc after arc and leaving node.
getNextArc(int, int) - Method in class morfologik.fsa.FSAVer5Impl
 
getNumberOfArcs() - Method in class morfologik.fsa.FSA
Returns the number of arcs in this automaton.
getNumberOfArcs() - Method in class morfologik.fsa.FSAVer5Impl
Returns the number of arcs in this automaton.
getNumberOfNodes() - Method in class morfologik.fsa.FSA
Returns the number of nodes in this automaton.
getNumberOfNodes() - Method in class morfologik.fsa.FSAVer5Impl
Returns the number of nodes in this automaton.
getRootNode() - Method in class morfologik.fsa.FSA
Returns the identifier of the root node of this automaton.
getRootNode() - Method in class morfologik.fsa.FSAVer5Impl
Returns the start node of this automaton.
getStem() - Method in class morfologik.stemming.WordData
 
getStemBytes(ByteBuffer) - Method in class morfologik.stemming.WordData
Copy the stem's binary data (no charset decoding) to a custom byte buffer.
getTag() - Method in class morfologik.stemming.WordData
 
getTagBytes(ByteBuffer) - Method in class morfologik.stemming.WordData
Copy the tag's binary data (no charset decoding) to a custom byte buffer.
getTraversalHelper() - Method in class morfologik.fsa.FSA
Returns an object which can be used to walk the edges of this finite state automaton and match arbitrary sequences against its states.
getVersion() - Method in class morfologik.fsa.FSA
Returns the version number of the binary representation of this FSA.
getVersion(int) - Static method in class morfologik.fsa.FSAHelpers
Returns a version number for a set of flags.
getWord() - Method in class morfologik.stemming.WordData
 
getWordBytes(ByteBuffer) - Method in class morfologik.stemming.WordData
Copy the inflected word's binary data (no charset decoding) to a custom byte buffer.
go(CommandLine) - Method in class morfologik.tools.DumpTool
Command line entry point after parsing arguments.
gotoLength - Variable in class morfologik.fsa.FSA
Size of transition's destination node "address".
gotoOffset - Static variable in class morfologik.fsa.FSAVer5Impl
An offset in the arc structure, where the address field begins.

H

hashCode() - Method in class morfologik.stemming.WordData
 
hasNext() - Method in class morfologik.fsa.FSAFinalStatesIterator
Returns true if there are still elements in this iterator.
hasNext() - Method in class morfologik.stemming.DictionaryIterator
 

I

inflectionFrames() - Method in class morfologik.tools.InflectionFramesTool
 
InflectionFramesTool - Class in morfologik.tools
Calculate inflection frames from the Polish dictionary.
InflectionFramesTool() - Constructor for class morfologik.tools.InflectionFramesTool
 
initializeOptions(Options) - Method in class morfologik.tools.DumpTool
Command line options for the tool.
isArcFinal(int) - Method in class morfologik.fsa.FSA
Returns true if the destination node at the end of this arc corresponds to an input sequence created when building this automaton.
isArcFinal(int) - Method in class morfologik.fsa.FSAVer5Impl
 
isArcTerminal(int) - Method in class morfologik.fsa.FSA
Returns true if this arc does not have a terminating node.
isArcTerminal(int) - Method in class morfologik.fsa.FSAVer5Impl
 
isSet(int, FSAFlags) - Static method in enum morfologik.fsa.FSAFlags
Returns true if the corresponding flag is set in the bit set.
IStemmer - Interface in morfologik.stemming
A generic "stemmer" interface in Morfologik.
iterator() - Method in class morfologik.fsa.FSA
Returns an iterator over all binary sequences starting from the initial FSA state and ending in final nodes.
iterator() - Method in class morfologik.stemming.DictionaryLookup
Return an iterator over all WordData entries available in the embedded Dictionary.
iterator() - Method in class morfologik.stemming.PolishStemmer
Iterates over all dictionary forms stored in this stemmer.

L

Launcher - Class in morfologik.tools
A launcher for other command-line tools.
Launcher() - Constructor for class morfologik.tools.Launcher
 
lookup(CharSequence) - Method in class morfologik.stemming.DictionaryLookup
Searches the automaton for a symbol sequence equal to word, followed by a separator.
lookup(CharSequence) - Method in interface morfologik.stemming.IStemmer
Returns a list of WordData entries for a given word.
lookup(CharSequence) - Method in class morfologik.stemming.PolishStemmer
Returns a list of WordData entries for a given word.

M

main(String[]) - Static method in class morfologik.tools.DumpTool
Command line entry point.
main(String[]) - Static method in class morfologik.tools.InflectionFramesTool
 
main(String[]) - Static method in class morfologik.tools.Launcher
Command line entry point.
matchSequence(FSAMatch, byte[], int, int, int) - Method in class morfologik.fsa.FSATraversalHelper
Same as FSATraversalHelper.matchSequence(byte[], int, int, int), but allows passing a reusable FSAMatch object so that no intermediate garbage is produced.
matchSequence(byte[], int, int, int) - Method in class morfologik.fsa.FSATraversalHelper
Finds a matching path in the dictionary for a given sequence of labels from sequence and starting at node node.
matchSequence(byte[], int) - Method in class morfologik.fsa.FSATraversalHelper
 
matchSequence(byte[]) - Method in class morfologik.fsa.FSATraversalHelper
 
metadata - Variable in class morfologik.stemming.Dictionary
Metadata associated with the dictionary.
metadata - Variable in class morfologik.stemming.DictionaryMetadata
Other meta data not included above.
METADATA_FILE_EXTENSION - Static variable in class morfologik.stemming.Dictionary
Expected metadata file extension.
morfologik.fsa - package morfologik.fsa
 
morfologik.stemming - package morfologik.stemming
 
morfologik.tools - package morfologik.tools
 
morfologik.util - package morfologik.util
 

N

next() - Method in class morfologik.fsa.FSAFinalStatesIterator
 
next() - Method in class morfologik.stemming.DictionaryIterator
 

O

openInputStream(String) - Static method in class morfologik.util.ResourceUtils
Returns an input stream to the resource.

P

PerformanceTimer - Class in morfologik.util
Simple, simple performance checking.
PerformanceTimer() - Constructor for class morfologik.util.PerformanceTimer
 
PolishStemmer - Class in morfologik.stemming
A dictionary-based stemmer for the Polish language.
PolishStemmer() - Constructor for class morfologik.stemming.PolishStemmer
This constructor is initialized with a built-in dictionary or fails with a runtime exception if the dictionary is not available.

R

read(File) - Static method in class morfologik.stemming.Dictionary
Attempts to load a dictionary using the path to the FSA file and the expected metadata extension.
read(URL) - Static method in class morfologik.stemming.Dictionary
Attempts to load a dictionary using the URL to the FSA file and the expected metadata extension.
readAndClose(InputStream, InputStream) - Static method in class morfologik.stemming.Dictionary
Attempts to load a dictionary from opened streams of FSA dictionary data and associated metadata.
readFully(InputStream) - Method in class morfologik.fsa.FSA
Reads all bytes from an input stream.
readHeader(DataInput, long) - Method in class morfologik.fsa.FSA
Reads a FSA header from a stream.
readHeader(DataInput, long) - Method in class morfologik.fsa.FSAVer5Impl
Reads a FSA header from a stream.
remove() - Method in class morfologik.fsa.FSAFinalStatesIterator
Not implemented in this iterator.
remove() - Method in class morfologik.stemming.DictionaryIterator
 
ResourceUtils - Class in morfologik.util
Resource management utilities.
restartFrom(int) - Method in class morfologik.fsa.FSAFinalStatesIterator
Restart walking from node.
run(Callable<Void>, int, int) - Static method in class morfologik.util.PerformanceTimer
Run the task with a given number of warm-up rounds and the given number of cycles.

S

separator - Variable in class morfologik.stemming.DictionaryMetadata
A separator character between fields (stem, lemma, form).

T

toString() - Method in class morfologik.util.PerformanceTimer
 

U

usesInfixes - Variable in class morfologik.stemming.DictionaryMetadata
True if the dictionary was compiled with infix compression.
usesPrefixes - Variable in class morfologik.stemming.DictionaryMetadata
True if the dictionary was compiled with prefix compression.

V

valueOf(String) - Static method in enum morfologik.fsa.FSAFlags
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum morfologik.fsa.FSAMatchType
Returns the enum constant of this type with the specified name.
values() - Static method in enum morfologik.fsa.FSAFlags
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum morfologik.fsa.FSAMatchType
Returns an array containing the constants of this enum type, in the order they are declared.
version - Variable in class morfologik.fsa.FSA
Dictionary version (derived from the combination of flags).
VERSION_5 - Static variable in class morfologik.fsa.FSA
Version number for version 5 of the automaton.

W

WordData - Class in morfologik.stemming
Stem and tag data associated with a given word.

A B C D E F G H I L M N O P R S T U V W