Last updated:
0 purchases
pytypo 0.3.0
pytypo
pytypo corrects English spelling mistakes.
That feature is based on TYPO CORPUS (http://luululu.com/tweet/)
and Wikipedia (https://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines)
And this module normalizes also lengthened English expression having repeating letters.
(e.g., this module converts “cooooooooooooooollllllllllllll” to “cool”)
That feature is based on the following paper:
Samuel Brody and Nicholas Diakopoulos.
Cooooooooooooooollllllllllllll!!!!!!!!!!!!!! using word lengthening to detect sentiment in microblogs.
In EMNLP2011, pp. 562-570, 2011.
http://aclweb.org/anthology//D/D11/D11-1052.pdf
Contributions are welcome!
Installation
$ pip install pytypo
Usage
Import pytypo
>>> import pytypo
correct sentence
>>> pytypo.correct_sentence('you are coooolll!!!')
you are cool!
correct_sentence(str)
correct word
>>> pytypo.correct('okayyyyy')
okay
correct(str)
Shorten repeated substring until threshould without dictionary
>>> pytypo.cut_repeat('mamisaaaaaan', 1)
mamisan
>>> pytypo.cut_repeat('okayyyyy', 2)
okayy
cut_repeat(str, threshould)
Note that this method don’t use a lengthened expression normalize table (e.g., cooll -> cool).
If you want to normalize such expression, use correct() or correct_sentence() method.
License
This module is licensed under MIT License.
CHANGES
0.3 (2017-10-18)
Add many cases from Wikipedia
0.2 (2016-04-15)
Add many cases
0.1 (2016-04-14)
First release.
For personal and professional use. You cannot resell or redistribute these repositories in their original state.
There are no reviews.