mynameisfiber / countmemaybe
A set of distinct value estimators that give probabilistic bounds on a sets cardinality
☆22Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for countmemaybe
- Low-level primitives for collapsed Gibbs sampling in python and C++☆33Updated 7 months ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 8 years ago
- Simulations for my blog post, as well as some helper functions for R users and Python users.☆18Updated 7 years ago
- ☆21Updated 8 years ago
- A Python library for learning from dimensionality reduction, supporting sparse and dense matrices.☆78Updated 7 years ago
- ☆10Updated 9 years ago
- Demo code for learning_text_transformer☆25Updated 9 years ago
- Compute association strength over semantic networks in a dimensionality-reduced form.☆33Updated 9 years ago
- Infinite relational model (IRM) for datamicroscopes☆14Updated 9 years ago
- vIPer: a new tool for IPython notebooks.☆60Updated 9 years ago
- Low-level primitives for collapsed Gibbs sampling in python and C++☆16Updated 7 months ago
- Data science tools from Moz☆22Updated 7 years ago
- workflow support for reproducible deduplication and merging☆16Updated last year
- ggplot2-inspired d3 app to make instant interactive visualizations☆55Updated 12 years ago
- Python wrapper for the Vowpal Wabbit machine learning library.☆53Updated 11 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 9 years ago
- Machine Learning Open Source Software☆23Updated 6 years ago
- ☆22Updated 9 years ago
- ☆19Updated 5 years ago
- Fast Vector Operations on Pretty Big Data☆13Updated 9 years ago
- A helper repository for converting Jupyter notebooks into a wordpress-friendly format☆12Updated 7 years ago
- Fast Dot Products on Pretty Big Data☆15Updated 6 years ago
- Topic modeling web application☆39Updated 9 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 2 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆34Updated 8 years ago
- Automated NLP sentiment predictions- batteries included, or use your own data☆18Updated 6 years ago
- mltk - Moz Language Tool Kit☆12Updated 9 years ago