python-streamz / streamz
Real-time stream processing for python
☆1,244Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for streamz
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆623Updated last week
- python implementation of the parquet columnar file format.☆787Updated last week
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,013Updated last week
- Extended pickling support for Python objects☆1,661Updated last month
- Python library for building highly effective data science workflows☆952Updated last year
- A distributed task scheduler for Dask☆1,579Updated this week
- Easy pipelines for pandas DataFrames.☆716Updated 3 weeks ago
- Scalable Machine Learning with Dask☆902Updated 3 months ago
- Concurrent data pipelines in Python >>>☆1,549Updated last year
- Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark☆1,481Updated this week
- A Python library for unevenly-spaced time series analysis☆530Updated 2 months ago
- Fast Avro for Python☆645Updated this week
- Numba extension for compiling Pandas data frames, Intel® Scalable Dataframe Compiler☆646Updated last year
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,364Updated this week
- Fast NumPy array functions written in C☆1,073Updated last month
- Quilt is a data mesh for connecting people with actionable data☆1,330Updated this week
- Streaming reactive and dataflow graphs in Python☆442Updated this week
- Airspeed Velocity: A simple Python benchmarking tool with web-based reporting☆875Updated 2 months ago
- Immutable and statically-typeable DataFrames with runtime type and data validation☆442Updated this week
- Computing with Python functions.☆3,880Updated last week
- bamboolib - a GUI for pandas DataFrames☆939Updated 9 months ago
- A library for defensive data analysis.☆501Updated 4 years ago
- Cython implementation of Toolz: High performance functional utilities☆1,009Updated 2 weeks ago
- A specification that python filesystems should adhere to.☆1,037Updated this week
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow☆2,081Updated 11 months ago
- Data Migration for the Blaze Project☆1,004Updated 2 years ago
- Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code. Tupl…☆810Updated 7 months ago
- A light-weight, flexible, and expressive statistical data testing library☆3,401Updated this week
- Describing statistical models in Python using symbolic formulas☆954Updated this week
- A Python package for manipulating 2-dimensional tabular data structures☆1,818Updated 3 weeks ago