gakhov / pdsaLinks
Probabilistic Data Structures and Algorithms in Python
☆131Updated 5 years ago
Alternatives and similar repositories for pdsa
Users that are interested in pdsa are comparing it to the libraries listed below
Sorting:
- Python bindings for xorfilter(faster and smaller than bloom and cuckoo filters)☆119Updated 3 months ago
- Core C++ Sketch Library☆247Updated last month
- FlorDB 🌻☆156Updated 2 months ago
- Keyvi - the key value index. It is an in-memory FST-based data structure highly optimized for size and lookup performance.☆254Updated last week
- Probabilistic data structures in python http://pyprobables.readthedocs.io/en/latest/index.html☆122Updated 3 weeks ago
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.☆41Updated 2 years ago
- Python implementations of the distributed quantile sketch algorithm DDSketch☆89Updated 7 months ago
- Sketching linear classifiers over data streams with the Weight-Median Sketch (SIGMOD 2018).☆39Updated 7 years ago
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆301Updated last year
- Parameterless and Universal FInding of Nearest Neighbors☆59Updated 9 months ago
- A Scalable Auto-ML System☆55Updated 2 years ago
- Python bindings to Succinct Data Structure Library 2.0☆34Updated 6 years ago
- Distribution transparent Machine Learning experiments on Apache Spark☆91Updated last year
- A polystore database from researchers of the Intel Science and Technology Center for Big Data☆39Updated 3 years ago
- Embedded MonetDB with a Python frontend and fast Numpy/Pandas support☆64Updated last year
- PostgreSQL extension providing approximate algorithms based on apache/datasketches-cpp☆90Updated 5 months ago
- Weighted MinHash implementation on CUDA (multi-gpu).☆122Updated 2 years ago
- an anagram☆136Updated 4 years ago
- Fast HyperLogLog for Python.☆110Updated 3 months ago
- Website for DataSketches.☆107Updated 2 weeks ago
- LSH index for approximate set containment search☆61Updated 3 years ago
- Apache datasketches☆38Updated 4 months ago
- Dremio Flight connector. Access Dremio using Arrow flight☆39Updated 5 years ago
- Ray-based Apache Beam runner☆42Updated 2 years ago
- Python bindings for the fast integer compression library FastPFor.☆61Updated 2 years ago
- A collection of libraries for single-pass, distributed, sublinear-space approximate aggregation and sketching algorithms. Currently: Hype…☆164Updated 7 months ago
- Willump Is a Low-Latency Useful Machine learning Platform.☆45Updated 2 years ago
- Search for similar short strings☆53Updated 5 years ago
- Red/black tree with support for fast accumulation of values in a key range☆18Updated last year
- Implements the Karnin-Lang-Liberty (KLL) algorithm in python☆58Updated 3 years ago