mynameisfiber / countmemaybe
A set of distinct value estimators that give probabilistic bounds on a sets cardinality
☆22Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for countmemaybe
- Low-level primitives for collapsed Gibbs sampling in python and C++☆33Updated 7 months ago
- ☆21Updated 8 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆34Updated 8 years ago
- Compute association strength over semantic networks in a dimensionality-reduced form.☆33Updated 9 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 8 years ago
- Python wrapper for the Vowpal Wabbit machine learning library.☆53Updated 11 years ago
- Topic modeling web application☆39Updated 9 years ago
- Infinite relational model (IRM) for datamicroscopes☆14Updated 9 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- Memory-efficient Count-Min Sketch Counter (based on Madoka C++ library)☆25Updated 5 years ago
- Low-level primitives for collapsed Gibbs sampling in python and C++☆16Updated 7 months ago
- A Python library for learning from dimensionality reduction, supporting sparse and dense matrices.☆78Updated 7 years ago
- mltk - Moz Language Tool Kit☆12Updated 9 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 9 years ago
- SmallK: very fast data clustering tools☆14Updated 5 years ago
- C++ Ternary Search Tree implementation with Python bindings☆43Updated 6 years ago
- a Simple API for RDF☆29Updated 15 years ago
- Fast structured perceptron sequential labeler☆15Updated 8 years ago
- Data science tools from Moz☆22Updated 7 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 9 years ago
- workflow support for reproducible deduplication and merging☆16Updated last year
- A Bayesian testing framework written in Python.☆95Updated 9 years ago
- This repo contain the exercies of the Next.ML 2015 presentation☆24Updated 9 years ago
- a latex cheat sheet with ipython commands and shortcuts☆10Updated 10 years ago
- hacky exploratory variants on NN language models☆9Updated 9 years ago
- MetroMaps Release☆16Updated 10 years ago
- Demo code for learning_text_transformer☆25Updated 9 years ago
- ☆28Updated 8 years ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆39Updated 9 years ago