ikucan / pykafarr
A high-performance Python Kafka client. Efficiently from Kafka to Pandas and back.
☆40Updated 6 years ago
Alternatives and similar repositories for pykafarr
Users that are interested in pykafarr are comparing it to the libraries listed below
Sorting:
- A Python implementation of Apache Kafka Streams☆311Updated 6 years ago
- Derivatives models written with the Tributary data flow library☆23Updated 3 weeks ago
- Stream Processing Made Easy☆41Updated 3 years ago
- A Cookiecutter template for creating Faust projects quickly.☆70Updated 2 years ago
- A consistent table management library in python☆159Updated 2 years ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Helpers & syntactic sugar for PySpark.☆62Updated last year
- A python wrapper for the KSQL REST API.☆159Updated last year
- SQL on dataframes - pandas and dask☆64Updated 7 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆68Updated last year
- A kafka streams client library built on confluent-kafka-python☆67Updated last year
- Deploy dask on YARN clusters☆69Updated 9 months ago
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆24Updated last year
- A Python framework for data processing on GCP.☆118Updated last month
- Jupyter extensions for SWAN☆58Updated last month
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- Dockerized setup for testing code on realistic hadoop clusters☆27Updated 4 years ago
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆33Updated 2 years ago
- Python stream processing for analytics☆38Updated last month
- Distributed event processing for Python based on Redis Streams☆133Updated 4 years ago
- DB API 2 interface for Flight SQL with SQLAlchemy extras.☆39Updated last month
- Python DataFrame with fast insert and appends☆75Updated last month
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 3 years ago
- Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.☆65Updated 4 years ago
- A tool and library for easily deploying applications on Apache YARN☆143Updated last year
- Function dependencies resolution and execution☆70Updated 4 years ago
- Python Rest Client to interact against Schema Registry confluent server☆176Updated this week
- Utilities for creating ETL pipelines with mara☆36Updated 2 years ago
- Pylint plugin for static code analysis on Airflow code☆94Updated 4 years ago
- A web frontend for scheduling Jupyter notebook reports☆252Updated 5 months ago