mynameisfiber / countmemaybeLinks
A set of distinct value estimators that give probabilistic bounds on a sets cardinality
☆22Updated 5 years ago
Alternatives and similar repositories for countmemaybe
Users that are interested in countmemaybe are comparing it to the libraries listed below
Sorting:
- ☆21Updated 9 years ago
- Low-level primitives for collapsed Gibbs sampling in python and C++☆33Updated last year
- mltk - Moz Language Tool Kit☆12Updated 10 years ago
- Latent Dirichlet Allocation for topic modeling of streamed data sources☆100Updated 10 years ago
- C++ Ternary Search Tree implementation with Python bindings☆43Updated 7 years ago
- Simple approximate-nearest-neighbours in Python using locality sensitive hashing.☆141Updated 13 years ago
- Topic modeling web application☆40Updated 10 years ago
- Low-level primitives for collapsed Gibbs sampling in python and C++☆16Updated last year
- implementations of a counting bloom, a timing bloom and a scaling timing bloom... made for working with streams!☆42Updated 8 years ago
- A framework (comand line tool + libraries) for creating flexible compute pipelines☆56Updated 4 years ago
- Compute association strength over semantic networks in a dimensionality-reduced form.☆32Updated 10 years ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆41Updated 10 years ago
- Simple spill-to-disk dictionary☆61Updated 3 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Updated 8 years ago
- A Python library for learning from dimensionality reduction, supporting sparse and dense matrices.☆78Updated 8 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Updated 9 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- A streaming cross-cat inference engine☆20Updated last year
- scikit-learn addon to operate on set/"group"-based features☆41Updated 9 years ago
- Dependency computation project for maven/java and pip/python examples☆63Updated 12 years ago
- Fast Dot Products on Pretty Big Data☆15Updated 7 years ago
- Python forecasting and smoothing library☆67Updated 6 years ago
- Library for GPU-related statistical functions☆84Updated 13 years ago
- ggplot2-inspired d3 app to make instant interactive visualizations☆55Updated 13 years ago
- Randomly sample lines from a csv, tsv, or other line-based data file☆125Updated 10 years ago
- The Zero Effort Network Library for Python☆67Updated 7 years ago
- A PyData 2013 talk on straightforward, data-driven ways to handle natural language text in Python.☆51Updated 11 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 10 years ago
- Partitioned storage system based on blosc. **No longer actively maintained.**☆154Updated 9 years ago
- Autoencoders to find structure in arbitrary datasets☆122Updated 10 years ago