ikegami-yukino / madoka-python
Memory-efficient Count-Min Sketch Counter (based on Madoka C++ library)
☆26Updated 6 years ago
Alternatives and similar repositories for madoka-python:
Users that are interested in madoka-python are comparing it to the libraries listed below
- Online machine learning algorithms (based on OLL C++ library)☆22Updated 7 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- C++ Ternary Search Tree implementation with Python bindings☆43Updated 7 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 9 years ago
- ☆10Updated 9 years ago
- C++ library for modeling with Pitman-Yor processes☆34Updated 7 years ago
- NYAN is a news filtering engine written in Python and some Ruby.☆15Updated last year
- A platform for storing large semantic networks on MongoDB☆22Updated 13 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 9 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 4 months ago
- Implementation of Bayesian Sets for fast similarity searches.☆14Updated 13 years ago
- A flexible variational inference LDA library.☆22Updated 5 years ago
- brat rapid annotation tool (brat) - for all your textual annotation needs☆10Updated 7 years ago
- USAAR participation in SemEval2015☆11Updated 2 years ago
- A pure Python implementation of Aho-Corasick algorithm.☆22Updated 6 years ago
- Infinite relational model (IRM) for datamicroscopes☆14Updated 9 years ago
- Non-Overlapping Aho-Corasick Python extension, for Python 2 (str and unicode) and Python 3☆51Updated 9 years ago
- various simple RNNs trained on synthetic grammars☆30Updated 9 years ago
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆29Updated 2 months ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Updated 9 years ago
- kaggle allen ai competition☆17Updated 8 years ago
- An efficient algorithm for k-bounded (Damerau-)Levenshtein distance☆16Updated 6 years ago
- Simple CORPORA list crawler☆10Updated 8 years ago
- ☆16Updated 8 years ago
- Induce word representations using random indexing (RI)☆29Updated 14 years ago
- This repo contain the exercies of the Next.ML 2015 presentation☆24Updated 10 years ago
- A project for clustering text streams using locality-sensitive hashing (LSH) in Python☆26Updated 13 years ago
- Topic Model or LDA in Cython☆21Updated 13 years ago
- A re-implementation of redpony/cdec's tokenize-anything.pl script in python☆8Updated 9 years ago
- A latent-annotated probabilistic context-free grammar (LAPCFG) parser.☆30Updated last year