mynameisfiber / countmemaybeLinks
A set of distinct value estimators that give probabilistic bounds on a sets cardinality
☆22Updated 5 years ago
Alternatives and similar repositories for countmemaybe
Users that are interested in countmemaybe are comparing it to the libraries listed below
Sorting:
- ☆21Updated 9 years ago
- Low-level primitives for collapsed Gibbs sampling in python and C++☆33Updated last year
- Simple approximate-nearest-neighbours in Python using locality sensitive hashing.☆140Updated 13 years ago
- mltk - Moz Language Tool Kit☆12Updated 10 years ago
- C++ Ternary Search Tree implementation with Python bindings☆43Updated 7 years ago
- A Python library for learning from dimensionality reduction, supporting sparse and dense matrices.☆78Updated 8 years ago
- A streaming cross-cat inference engine☆20Updated last year
- implementations of a counting bloom, a timing bloom and a scaling timing bloom... made for working with streams!☆42Updated 8 years ago
- Low-level primitives for collapsed Gibbs sampling in python and C++☆16Updated last year
- Compute association strength over semantic networks in a dimensionality-reduced form.☆32Updated 10 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Updated 8 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 9 years ago
- Simple spill-to-disk dictionary☆61Updated 3 years ago
- Topic modeling web application☆40Updated 10 years ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆41Updated 9 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 8 years ago
- A framework (comand line tool + libraries) for creating flexible compute pipelines☆56Updated 4 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 10 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Updated 8 years ago
- a Simple API for RDF☆29Updated 15 years ago
- Website for standardized execution and evaluation of algorithms on datasets.☆35Updated 5 years ago
- Library for GPU-related statistical functions☆84Updated 12 years ago
- Latent Dirichlet Allocation for topic modeling of streamed data sources☆100Updated 10 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 10 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Deploying tmpnb nodes☆18Updated 7 years ago
- Topic Model or LDA in Cython☆21Updated 14 years ago
- Simple spill-to-disk dictionary☆18Updated 9 years ago
- Semanticizest: dump parser and client☆20Updated 9 years ago
- A project to demonstrate maximum entropy models for extracting quotes from news articles in Python.☆25Updated 13 years ago