maki-nage / distogram
A library to compute histograms on distributed environments, on streaming data
β23Updated 2 years ago
Related projects: β
- 𧬠Modularised Evolutionary Algorithms For Python with Optional JIT and Multiprocessing (Ray) support. Inspired by PyTorch Lightningβ52Updated last year
- Stream Processing Made Easyβ38Updated 2 years ago
- Cython implementation of Toolz. Please use: https://github.com/pytoolz/cytoolzβ39Updated 7 months ago
- Cross Thread Message Pipeβ18Updated 4 years ago
- Fast and Scalable Data Structures for Scientific and Quantitative Research.β12Updated 5 years ago
- A lightweight Python module for creating and running ordered graphs of computations.β84Updated last year
- Convenient pyarrow operations following the Pandas APIβ43Updated 2 years ago
- Notebooks, slides, and examples for "Streaming, cross-sectional data visualization in Jupyterlab with Perspective and Apache Arrow", my Jβ¦β26Updated 3 years ago
- Simple in memory data cache designed for ML applications. Built using Redis and Apache Arrow's Plasma in-memory storeβ9Updated 3 years ago
- Quickly move data from postgres to numpy or pandas.β63Updated last year
- Streaming API for pandas applied to big datasetsβ29Updated this week
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the sameβ¦β28Updated last year
- Python driver for Timeplus Enterprise or Timeplus Protonβ11Updated last month
- β29Updated this week
- Function dependencies resolution and executionβ71Updated 4 years ago
- Set-oriented Operations in Pandasβ24Updated 4 years ago
- asyncio bridge to the duckdb libraryβ31Updated last year
- Dataflow based workflow frameworkβ41Updated 3 years ago
- Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.β65Updated 3 years ago
- β21Updated 4 years ago
- Python bindings for xorfilter(faster and smaller than bloom and cuckoo filters)β111Updated 2 weeks ago
- Python bindings for CityHashβ10Updated 11 months ago
- β38Updated 3 months ago
- Unified Distributed Executionβ46Updated last week
- Derivatives models written with the Tributary data flow libraryβ19Updated 7 months ago
- Run-length encoded arrays for pandas.β21Updated last year
- Process, visualize and use data easily.β20Updated last year
- Simple, lightweight, extensible DAG framework for Python with a Kubeflow-like APIβ63Updated 6 months ago
- Python DataFrame with fast insert and appendsβ74Updated last year
- A powerful data analysis package based on mathematical step functions. Strongly aligned with pandas.β59Updated 3 weeks ago