mzsanford / cld
Language Detection based on Chromium's Compact Language Detector library
☆104Updated 4 years ago
Alternatives and similar repositories for cld:
Users that are interested in cld are comparing it to the libraries listed below
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆160Updated 4 years ago
- Python-bindings for CityHash (http://code.google.com/p/cityhash/)☆32Updated 10 years ago
- Mirror of Apache Lucy☆99Updated 6 years ago
- An almost deterministic top k elements counter Redis module☆35Updated 5 years ago
- Jeremy's Machine Learning Library☆52Updated 8 years ago
- Python API for Various DB-Backed Simhash Clusters☆64Updated 7 years ago
- Fast Protocol Buffers module for Python☆40Updated 9 years ago
- C++ utility library☆24Updated 11 years ago
- Git mirror for the FastBit library Subversion repository.☆71Updated 8 years ago
- Simhashing in C++☆133Updated 2 years ago
- Pretty fast parser for probabilistic context free grammars☆87Updated 11 years ago
- An implementation of the MinHash algorithm in ruby using Murmur Hash☆24Updated 15 years ago
- C library for efficient string matching with Aho-Corasick☆21Updated 13 years ago
- A Hadoop toolkit for web-scale information retrieval research☆82Updated 10 years ago
- python bingding for leveldb using c api☆86Updated 11 years ago
- google all pairs similarity search package, with swig bindings☆22Updated 9 years ago
- A high performance search engine☆104Updated 8 years ago
- An inverted trigram index for accelerated string matching in Sqlite.☆77Updated 10 years ago
- trying shingling / resemblance / simhash / sketching to do some data deduping☆98Updated 9 years ago
- C++ implementation of hamming distance algorithm HmSearch using Kyoto Cabinet☆42Updated 8 years ago
- Protobufs Are Lightweight Messages☆53Updated 11 years ago
- Fast decoder for VByte-compressed integers☆122Updated 9 months ago
- Trinity IR Infrastructure☆237Updated 5 years ago
- C++11 library for network services on modern x86_64 Linux☆87Updated 9 years ago
- Big Data Made Easy☆187Updated 7 years ago
- Facebook's contrib fb303 library☆28Updated 14 years ago
- realtime pipeline processing engine☆62Updated 10 years ago
- C network daemon for HyperLogLogs☆449Updated 4 years ago
- Library implementing the storage and the query evaluation for a text search engine. It uses on a key value store database interface to st…☆47Updated 3 years ago
- A command line tool for bulk geolocation queries written in C++.☆58Updated last year