commonsmachinery / blockhash-pythonLinks
Implementation of perceptual image hash calculation in Python
☆132Updated 2 years ago
Alternatives and similar repositories for blockhash-python
Users that are interested in blockhash-python are comparing it to the libraries listed below
Sorting:
- A Python Perceptual Image Hashing Module☆215Updated 3 years ago
- Levenshtein and Hamming distance computation☆116Updated 6 years ago
- Python library to calculate the difference hash (perceptual hash) for a given image, useful for detecting duplicates☆373Updated last year
- Fast multi-keyword search engine for text strings☆257Updated last year
- A fast Python implementation of locality sensitive hashing.☆70Updated 10 years ago
- Locality-sensitive hashing algorithm for text similarity comparisons☆58Updated 7 months ago
- URL normalization for Python☆99Updated 6 months ago
- A simple fuzzy matching set for python strings☆230Updated last year
- 💥 Cython hash tables that assume keys are pre-hashed☆87Updated this week
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- Naïve Bayesian Text Classifier on Redis☆116Updated 6 years ago
- C++ Ternary Search Tree implementation with Python bindings☆43Updated 7 years ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆250Updated last month
- Python BK-tree data structure to allow fast querying of "close" matches☆186Updated 4 years ago
- Fast, (mostly) lossless JPEG transformations with Python☆147Updated 2 years ago
- A python interface to djb's cdb library☆68Updated 4 years ago
- LZ4Frame Bindings and tools for Python☆90Updated 3 years ago
- A Python implementation of the Double Metaphone algorithm☆61Updated 15 years ago
- Weighted Levenshtein library☆113Updated 2 years ago
- A component that tries to avoid downloading duplicate content☆27Updated 7 years ago
- Formasaurus tells you the type of an HTML form and its fields using machine learning☆119Updated last year
- Python library for reading and writing warc files☆245Updated 3 years ago
- A pure python implementation of locality sensitive hashing for text documents☆85Updated 10 years ago
- Python library for extracting text from various file formats (for indexing).☆113Updated 3 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆157Updated 2 months ago
- Tool to visualize data quickly with no brain usage for plot creation☆46Updated 3 weeks ago
- extract difference between two html pages☆32Updated 7 years ago
- Tool for managing data-deduplication within extant compressed archive files, along with a relatively performant BK tree implementation fo…☆106Updated 2 years ago
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆303Updated last year
- Show summary of a large number of URLs in a Jupyter Notebook☆17Updated 4 years ago