LatvianStemmer 1.0.2

Creator: bradpython12

Last updated:

Add to Cart

Description:

LatvianStemmer 1.0.2

LatvianStemmer
The original Java code can be found in https://github.com/apache/lucene-solr
Ported to Python by Rihards KriĊĦlauks with minor modifications
Light stemmer for Latvian.
This is a light version of the algorithm in Karlis Kreslin's PhD thesis A stemming algorithm for Latvian with the following modifications:

Only explicitly stems noun and adjective morphology
Stricter length/vowel checks for the resulting stems (verb etc suffix stripping is removed)
Removes only the primary inflectional suffixes: case and number for nouns case, number, gender, and definitiveness for adjectives.
Palatalization is only handled when a declension II,V,VI noun suffix is removed.

Usage
pip install LatvianStemmer
lvstemmer < input.txt > output.txt
# or
lvstemmer input1.txt input2.txt > output.txt

License

For personal and professional use. You cannot resell or redistribute these repositories in their original state.

Customer Reviews

There are no reviews.