svpcom / hyperloglogLinks
Python Implementation of Hyper LogLog and Sliding Hyper LogLog algorithms
☆236Updated 2 months ago
Alternatives and similar repositories for hyperloglog
Users that are interested in hyperloglog are comparing it to the libraries listed below
Sorting:
- Fast HyperLogLog for Python.☆108Updated 2 weeks ago
- Python library for handling efficiently sorted integer sets.☆215Updated last month
- python implementation of the parquet columnar file format.☆353Updated 3 years ago
- Python Non-cryptographic Hash Library☆283Updated last year
- Battle-tested Apache Storm Multi-Lang implementation for Python☆70Updated 2 weeks ago
- t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark☆399Updated 2 years ago
- A list-like type with better asymptotic performance and similar performance on small lists☆316Updated 2 years ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆137Updated 4 years ago
- Roaring Bitmap in Cython☆81Updated last year
- An in-memory LRU cache for python☆154Updated 4 years ago
- Python bindings for the snappy google library☆484Updated 9 months ago
- Pure Python wrapper for the Hadoop WebHDFS Rest API☆52Updated 4 years ago
- Python Low-Overhead Profiler☆919Updated 7 months ago
- Python bindings for RocksDB☆151Updated 3 years ago
- Briefly - A Python Meta-programming Library for Job Flow Control☆106Updated 6 years ago
- Application level metrics collector library☆60Updated 9 years ago
- Full featured consistent hashing python library compatible with ketama☆211Updated 4 months ago
- Apache Mesos backend for Dask scheduling library☆28Updated 7 years ago
- Python extension for MurmurHash (MurmurHash3), a set of fast and robust hash functions.☆344Updated this week
- Performance metrics, based on Coda Hale's Yammer metrics☆196Updated 2 years ago
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆270Updated 11 months ago
- A pure python HDFS client☆857Updated 3 years ago
- ☆322Updated 10 months ago
- PySpark Cassandra brings back the fun in working with Cassandra data in PySpark.☆79Updated 8 years ago
- Asynchronous replication framework for distributed Python projects☆353Updated 2 years ago
- Redis in a python module.☆591Updated 11 months ago
- A Python MapReduce and HDFS API for Hadoop☆240Updated 6 months ago
- Fast Python Bloom Filter using Mmap☆744Updated 5 years ago
- disque python client☆78Updated 6 years ago
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆305Updated last year