vilda / shashLinks
Similarity hashing
β49Updated 14 years ago
Alternatives and similar repositories for shash
Users that are interested in shash are comparing it to the libraries listed below
Sorting:
- π SQLite extension to add the Okapi BM25 ranking algorithmβ35Updated 10 years ago
- High Performance Marmotta Backend implementation in C++ (using gRPC and LevelDB)β16Updated 9 years ago
- N-grams approximate string matching implementation in pure Pythonβ26Updated 14 years ago
- Contains the implementation of algorithms that estimate the geographic location of media content based on their content and metadata. It β¦β15Updated 8 years ago
- Reduced on-disk Suffix Arrayβ22Updated 11 years ago
- epoll demoβ13Updated 7 years ago
- memcachedb ported from BerkeleyDB to LMDB originally from http://memcachedb.googlecode.com/svn/trunkβ87Updated 10 years ago
- A simple bloom filter for SQLite using Murmur3β18Updated 13 years ago
- Library implementing the storage and the query evaluation for a text search engine. It uses on a key value store database interface to stβ¦β47Updated 3 years ago
- Super efficient TCP connection between remote processesβ12Updated 9 years ago
- Collects multimedia content shared through social networks.β19Updated 10 years ago
- Recursively scans HTML pages for URLs and downloads desired content.β12Updated 8 years ago
- A crawler, indexer, and query interface all in Python with distributed processing via Pyro4.β23Updated 13 years ago
- C language port of google-diff-match-patch libraryβ41Updated 9 years ago
- A tool for semantic relation extraction. The program finds pairs of semantically related words based on the text definitions coming from β¦β26Updated 11 years ago
- Argos is a structural data search engineβ41Updated 11 years ago
- A pure C implementation of the Geohash algorithm.β106Updated 6 years ago
- A light weight, low level embedded key-value database libraryβ32Updated 12 years ago
- A framework for building reranking models.β28Updated 10 years ago
- pythonic filesystem libraryβ35Updated 13 years ago
- A simple and fast search engineβ70Updated 3 years ago
- Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Basβ¦β12Updated 11 years ago
- a tiny nosql database supporting pluggable storage engine.β40Updated 7 years ago
- Mirror of Apache Lucyβ99Updated 7 years ago
- natural language processing with link-grammarβ18Updated 15 years ago
- A Java library capable of constructing character-sequence-storing, directed acyclic graphs of minimal sizeβ43Updated 12 years ago
- google all pairs similarity search package, with swig bindingsβ22Updated 10 years ago
- Focused Crawler for VT's CTRNetβ10Updated 12 years ago
- LMAX's disruptor pattern implemented in Cβ96Updated last month
- Distributed search engine in C++14, using nanomsg for communication, bond for serialization.β67Updated 10 years ago