maki-nage / distogramLinks
A library to compute histograms on distributed environments, on streaming data
β23Updated 4 months ago
Alternatives and similar repositories for distogram
Users that are interested in distogram are comparing it to the libraries listed below
Sorting:
- 𧬠Modularised Evolutionary Algorithms For Python with Optional JIT and Multiprocessing (Ray) support. Inspired by PyTorch Lightningβ53Updated 2 years ago
- Dataflow based workflow frameworkβ41Updated 4 years ago
- Stream Processing Made Easyβ42Updated 3 years ago
- Function dependencies resolution and executionβ70Updated 5 years ago
- A lightweight Python module for creating and running ordered graphs of computations.β86Updated 2 years ago
- Unified Distributed Executionβ54Updated 8 months ago
- Yet another easy-to-use python3 parallel library for humans.β13Updated 4 years ago
- Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.β65Updated 4 years ago
- Magniv Core - A Python-decorator based job orchestration platform. Avoid responsibility handoffs by abstracting infra and DevOps.β79Updated last year
- Convenient pyarrow operations following the Pandas APIβ44Updated 3 years ago
- A library to instantiate any Python object from configuration files.β24Updated 2 years ago
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflowβ¦β11Updated 3 years ago
- Toolkit for graph-relational data across space and timeβ115Updated 10 months ago
- Python library for declarative, constrained, structured-output prediction.β21Updated last year
- Fast and Scalable Data Structures for Scientific and Quantitative Research.β11Updated 6 years ago
- Process, visualize and use data easily.β20Updated 2 years ago
- β21Updated 5 years ago
- β41Updated 2 months ago
- Support for jupyter notebook templates in jupyterlabβ25Updated 3 months ago
- Set-oriented Operations in Pandasβ24Updated 5 years ago
- Distributed persistent Task Queue running on Daskβ38Updated 2 years ago
- Python driver for Timeplus Enterprise or Timeplus Protonβ14Updated 7 months ago
- Convert monolithic Jupyter notebooks π into maintainable Ploomber pipelines. πβ79Updated 9 months ago
- Streaming API for pandas applied to big datasetsβ31Updated 10 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withouβ¦β113Updated last year
- plait.py - a fake data modelerβ435Updated 6 years ago
- π― aimrocks πΈ β python & cython bindings for RocksDB. Batteries included! πβ32Updated 4 months ago
- This repository is no longer maintained.β15Updated 3 years ago
- Data pipelines from re-usable componentsβ108Updated 2 years ago
- [ARCHIVED] Dask support for multi-GPU machine learning algorithms --> Moved to cumlβ16Updated 5 years ago