ascv / HyperLogLogLinks
Fast HyperLogLog for Python.
☆109Updated 3 weeks ago
Alternatives and similar repositories for HyperLogLog
Users that are interested in HyperLogLog are comparing it to the libraries listed below
Sorting:
- Hyper LogLog (native and sliding) cardinality counters☆240Updated last month
- Roaring Bitmap in Cython☆81Updated last year
- Python library for handling efficiently sorted integer sets.☆217Updated 2 months ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆137Updated 4 years ago
- python implementation of the parquet columnar file format.☆354Updated 3 years ago
- Partitioned storage system based on blosc. **No longer actively maintained.**☆153Updated 8 years ago
- Python bindings for RocksDB☆151Updated 4 years ago
- HAT-Trie for Python☆86Updated 9 years ago
- Python bindings for the SQLite4 LSM database.☆131Updated 2 months ago
- C++ native client for Impala and Hive, with Python / pandas bindings☆72Updated 7 years ago
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆303Updated last year
- A Directed Acyclic Graph task dependency scheduler designed to simplify complex distributed pipelines☆131Updated 7 years ago
- Briefly - A Python Meta-programming Library for Job Flow Control☆106Updated 7 years ago
- Unified interface for local and distributed ndarrays☆157Updated 6 years ago
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆271Updated last year
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 7 years ago
- Luigi Plugin for Hubot☆36Updated 9 years ago
- python client library☆10Updated 8 years ago
- A list-like type with better asymptotic performance and similar performance on small lists☆316Updated 2 years ago
- Utils around luigi.☆66Updated last month
- Python Non-cryptographic Hash Library☆287Updated 2 years ago
- Language defining a data description protocol☆185Updated 2 years ago
- Pure Python wrapper for the Hadoop WebHDFS Rest API☆52Updated 5 years ago
- ☆40Updated 8 years ago
- Python bindings for xorfilter(faster and smaller than bloom and cuckoo filters)☆117Updated 3 weeks ago
- Apache Mesos backend for Dask scheduling library☆28Updated 7 years ago
- t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark☆401Updated 2 years ago
- Python extension for MurmurHash (MurmurHash3), a set of fast and robust hash functions.☆350Updated 2 weeks ago
- implementations of a counting bloom, a timing bloom and a scaling timing bloom... made for working with streams!☆42Updated 8 years ago
- ☆324Updated 11 months ago