hlwy-lsh 0.3.6

Creator: bradpython12

Last updated:

Add to Cart

Description:

hlwylsh 0.3.6

hlwy-lsh
LSH is a Python implementation of locality sensitive hashing with minhash. It is very useful for detecting
near duplicate documents.
The implementation uses the MurmurHash v3 library to create document finger prints.
Cython is needed if you want to regenerate the .cpp files for the hashing and shingling code. By default
the setup script uses the pregenerated .cpp sources, you can change this with the USE_CYTHON flag in
setup.py
NumPy is needed to run the code.
The MurmurHash3 library is distributed under the MIT license. More information https://github.com/aappleby/smhasher
Installation
$ pip install hlwy-lsh


✨🍰✨

License

For personal and professional use. You cannot resell or redistribute these repositories in their original state.

Customer Reviews

There are no reviews.