fastforwardlabs / cuckoofilter
☆40Updated 7 years ago
Alternatives and similar repositories for cuckoofilter:
Users that are interested in cuckoofilter are comparing it to the libraries listed below
- implementations of a counting bloom, a timing bloom and a scaling timing bloom... made for working with streams!☆42Updated 7 years ago
- Python forecasting and smoothing library☆67Updated 5 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆51Updated 7 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆62Updated 8 years ago
- Partitioned storage system based on blosc. **No longer actively maintained.**☆152Updated 8 years ago
- ☆34Updated 8 years ago
- Python wrapper of xxhash☆34Updated 10 years ago
- Interactive performance benchmarking in Jupyter☆33Updated last month
- A Python library for dealing with splittable files☆42Updated 5 years ago
- Tool to visualize data quickly with no brain usage for plot creation☆46Updated 5 years ago
- Apache Mesos backend for Dask scheduling library☆28Updated 7 years ago
- HAT-Trie for Python☆86Updated 8 years ago
- MapReduce platform in python☆34Updated 9 years ago
- Simple approximate-nearest-neighbours in Python using locality sensitive hashing.☆140Updated 12 years ago
- Fetch and plot AWS spot pricing history☆23Updated 8 years ago
- Simple spill-to-disk dictionary☆60Updated 3 years ago
- ggplot2-inspired d3 app to make instant interactive visualizations☆55Updated 12 years ago
- S3-backed notebook manager for IPython☆29Updated 7 years ago
- Notes on Lambda Architecture☆12Updated 6 years ago
- spy on your random forests☆19Updated 4 years ago
- Creates models to classify documents into categories☆66Updated 7 years ago
- zero_buffer is a high-performance, zero-copy, implementation of a byte-buffer for Python.☆134Updated 7 years ago
- workflow support for reproducible deduplication and merging☆16Updated last year
- Autoencoders to find structure in arbitrary datasets☆123Updated 9 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 9 years ago
- ☆52Updated 8 years ago
- A fast Python implementation of locality sensitive hashing.☆70Updated 9 years ago
- Streaming estimation of percentiles, especially high percentiles.☆63Updated 12 years ago