A cluster implementation of simhash near-duplicate detection
☆32Mar 11, 2015Updated 11 years ago
Alternatives and similar repositories for simhash-cluster
Users that are interested in simhash-cluster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Gevent Crawling in Python, with Utilities☆22Mar 12, 2015Updated 11 years ago
- Python API for Various DB-Backed Simhash Clusters☆64Mar 16, 2017Updated 9 years ago
- This is a repository in which we take part in the big data competition, focusing on recommendation system.☆17May 24, 2016Updated 10 years ago
- ☆14Aug 24, 2021Updated 4 years ago
- TreeDict is a fast, flexible and full-featured hierarchical python container that makes simple and sophisticated bookkeeping easy.☆33Apr 14, 2016Updated 10 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- a minimum demo web framework based on servlet☆10Sep 3, 2015Updated 10 years ago
- Parser for KAF NAF files written in Python☆16Jul 1, 2021Updated 4 years ago
- 常用配置和工具☆29Sep 11, 2024Updated last year