vilda / shashLinks
Similarity hashing
β49Updated 14 years ago
Alternatives and similar repositories for shash
Users that are interested in shash are comparing it to the libraries listed below
Sorting:
- A high performance search engineβ106Updated 8 years ago
- π SQLite extension to add the Okapi BM25 ranking algorithmβ35Updated 10 years ago
- Simhashing in C++β135Updated 2 years ago
- A tool for semantic relation extraction. The program finds pairs of semantically related words based on the text definitions coming from β¦β26Updated 11 years ago
- Recursively scans HTML pages for URLs and downloads desired content.β12Updated 9 years ago
- A simple bloom filter for SQLite using Murmur3β18Updated 14 years ago
- A light weight, low level embedded key-value database libraryβ32Updated 12 years ago
- A crawler, indexer, and query interface all in Python with distributed processing via Pyro4.β23Updated 13 years ago
- Clone version of LingPipe 4.1.0, with support for unsupervised trainingβ32Updated 12 years ago
- A simple and fast search engineβ70Updated 3 years ago
- A comparison between different integer set techniquesβ14Updated 7 years ago
- Wrapper to pocketsphinx phoneme labeling toolsβ18Updated 9 years ago
- Metric tree demoβ14Updated 11 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobbyβ17Updated 3 years ago
- An almost deterministic top k elements counter Redis moduleβ35Updated 6 years ago
- Suite of tools for detecting changes in web pages and their renderingβ55Updated 2 years ago
- Minimal pub/sub message queue in C.β23Updated 11 years ago
- A simple proof of concept levenshtein automaton in Pythonβ108Updated 10 years ago
- Search Formula-1ββA distributed high performance massive data engine for enterprise/vertical searchβ169Updated 10 years ago
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.β21Updated 8 years ago
- SymSpell C++ Portsβ31Updated 7 years ago
- iCQA - Intelligent Community Question Answering Frameworkβ31Updated 9 years ago
- A vector similarity databaseβ230Updated 11 years ago
- A flexible implementation of enhanced suffix arrays in template based C++. Supports single and multi-position wildcard. Fast queries thanβ¦β21Updated 5 years ago
- Extract meaningful content from pdf and psd file, such as texts and images both linked into a common JSON stringβ36Updated 7 years ago
- A Library for Spherical Geometryβ57Updated 8 years ago
- A redis-protocol compatible frontend to google's leveldbβ204Updated last year
- epoll demoβ13Updated 8 years ago
- ζηθΎε ₯ζ³η»θθ―εΊθ§£ζβ17Updated 12 years ago
- Library implementing the storage and the query evaluation for a text search engine. It uses on a key value store database interface to stβ¦β47Updated 4 years ago