maki-nage / distogram
A library to compute histograms on distributed environments, on streaming data
β23Updated last month
Alternatives and similar repositories for distogram:
Users that are interested in distogram are comparing it to the libraries listed below
- Stream Processing Made Easyβ40Updated 2 years ago
- Fast and Scalable Data Structures for Scientific and Quantitative Research.β11Updated 6 years ago
- 𧬠Modularised Evolutionary Algorithms For Python with Optional JIT and Multiprocessing (Ray) support. Inspired by PyTorch Lightningβ53Updated 2 years ago
- Simple Workflow Framework - Hamilton + APScheduler = FlowerPowerβ16Updated this week
- A lightweight Python module for creating and running ordered graphs of computations.β86Updated 2 years ago
- Cython implementation of Toolz. Please use: https://github.com/pytoolz/cytoolzβ40Updated 4 months ago
- Useful Mutable Mappingsβ70Updated last year
- Derivatives models written with the Tributary data flow libraryβ23Updated 4 months ago
- Set-oriented Operations in Pandasβ24Updated 4 years ago
- Universal 1d/2d data containers with Transformers functionality for data analysis.β26Updated 2 years ago
- Unified Distributed Executionβ51Updated 5 months ago
- Concurrent appendable key-value storageβ106Updated 9 months ago
- Python 3 library to store memory mappable objects into pickle-compatible filesβ38Updated 6 years ago
- Function dependencies resolution and executionβ70Updated 4 years ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the sameβ¦β28Updated 2 years ago
- Python bindings to Succinct Data Structure Library 2.0β30Updated 5 years ago
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflowβ¦β11Updated 2 years ago
- A library to instantiate any Python object from configuration files.β24Updated 2 years ago
- Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.β65Updated 3 years ago
- A Python package that parses sql and converts it to ibis expressionsβ54Updated last year
- Loan Risk Prediction Neural Network and APIβ17Updated 4 years ago
- Simple in memory data cache designed for ML applications. Built using Redis and Apache Arrow's Plasma in-memory storeβ10Updated 4 years ago
- Dynamic Numpy arraysβ13Updated 8 years ago
- asyncio bridge to the duckdb libraryβ40Updated 2 years ago
- Distributed process pool for Pythonβ110Updated 2 years ago
- A pipeline framework for pythonβ104Updated last month
- Pandas Msgpackβ23Updated 2 years ago
- Scalable pattern search optimization with daskβ22Updated 8 years ago
- Distributed Task Queue based Daskβ38Updated last year
- A robust DAG implementation for parallel executionβ68Updated last year