ascv / HyperLogLogLinks
Fast HyperLogLog for Python.
☆110Updated 2 months ago
Alternatives and similar repositories for HyperLogLog
Users that are interested in HyperLogLog are comparing it to the libraries listed below
Sorting:
- Hyper LogLog (native and sliding) cardinality counters☆240Updated 2 months ago
- Roaring Bitmap in Cython☆81Updated last year
- Python library for handling efficiently sorted integer sets.☆225Updated last month
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆303Updated last year
- Unified interface for local and distributed ndarrays☆157Updated 7 years ago
- python implementation of the parquet columnar file format.☆355Updated 4 years ago
- A fast Python implementation of locality sensitive hashing.☆70Updated 10 years ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆137Updated 4 years ago
- Python bindings for RocksDB☆150Updated 4 years ago
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆271Updated last year
- ☆40Updated 8 years ago
- Python Non-cryptographic Hash Library☆287Updated 2 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Updated 8 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- Partitioned storage system based on blosc. **No longer actively maintained.**☆154Updated 8 years ago
- Python bindings for the SQLite4 LSM database.☆132Updated 4 months ago
- Concurrent appendable key-value storage☆107Updated last year
- Python bindings for FarmHash and CityHash☆46Updated last month
- Python Driver for Apache Drill.☆61Updated 2 years ago
- Briefly - A Python Meta-programming Library for Job Flow Control☆106Updated 7 years ago
- Python extension for MurmurHash (MurmurHash3), a set of fast and robust hash functions.☆352Updated last week
- A list-like type with better asymptotic performance and similar performance on small lists☆317Updated 2 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 7 years ago
- 💥 Cython hash tables that assume keys are pre-hashed☆86Updated 5 months ago
- persistent caching to memory, disk, or database☆276Updated last week
- Python library to infer date format from examples☆45Updated 3 years ago
- Caching based on computation time and storage space☆138Updated 4 years ago
- A Directed Acyclic Graph task dependency scheduler designed to simplify complex distributed pipelines☆131Updated 7 years ago
- t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark☆402Updated 2 years ago
- Luigi Plugin for Hubot☆36Updated 9 years ago