mynameisfiber / countmemaybeLinks
A set of distinct value estimators that give probabilistic bounds on a sets cardinality
☆22Updated 5 years ago
Alternatives and similar repositories for countmemaybe
Users that are interested in countmemaybe are comparing it to the libraries listed below
Sorting:
- Low-level primitives for collapsed Gibbs sampling in python and C++☆33Updated last year
- Simple approximate-nearest-neighbours in Python using locality sensitive hashing.☆140Updated 13 years ago
- ☆21Updated 9 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 9 years ago
- Library for building reproducible data pipelines to support experimentation☆20Updated 9 years ago
- Semanticizest: dump parser and client☆20Updated 9 years ago
- Python wrapper for the Vowpal Wabbit machine learning library.☆53Updated 11 years ago
- vIPer: a new tool for IPython notebooks.☆60Updated 10 years ago
- Low-level primitives for collapsed Gibbs sampling in python and C++☆16Updated last year
- Fast Vector Operations on Pretty Big Data☆13Updated 9 years ago
- Compute association strength over semantic networks in a dimensionality-reduced form.☆32Updated 9 years ago
- mltk - Moz Language Tool Kit☆12Updated 10 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 10 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 8 years ago
- Fast Dot Products on Pretty Big Data☆15Updated 6 years ago
- Infinite relational model (IRM) for datamicroscopes☆14Updated 9 years ago
- Topic modeling web application☆41Updated 9 years ago
- framework for making streamcorpus data☆11Updated 8 years ago
- TreeDict is a fast, flexible and full-featured hierarchical python container that makes simple and sophisticated bookkeeping easy.☆32Updated 9 years ago
- Data science tools from Moz☆22Updated 8 years ago
- ggplot2-inspired d3 app to make instant interactive visualizations☆55Updated 13 years ago
- Algorithms for "schema matching"☆26Updated 8 years ago
- a Simple API for RDF☆29Updated 15 years ago
- Simple spill-to-disk dictionary☆17Updated 9 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- A streaming cross-cat inference engine☆20Updated last year
- A copy of the source for Grinstead and Snell's lovely probability book☆14Updated 9 years ago
- ☆10Updated 10 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 8 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago