ikucan / pykafarr
A high-performance Python Kafka client. Efficiently from Kafka to Pandas and back.
☆39Updated 5 years ago
Related projects: ⓘ
- A Cookiecutter template for creating Faust projects quickly.☆70Updated last year
- SQL on dataframes - pandas and dask☆64Updated 6 years ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated last year
- A consistent table management library in python☆161Updated last year
- Deploy dask on YARN clusters☆69Updated last month
- Python Rest Client to interact against Schema Registry confluent server☆169Updated this week
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- Derivatives models written with the Tributary data flow library☆19Updated 7 months ago
- Dockerized setup for testing code on realistic hadoop clusters☆27Updated 4 years ago
- A Python implementation of Apache Kafka Streams☆314Updated 5 years ago
- Function dependencies resolution and execution☆71Updated 4 years ago
- Shared-memory Python object namespace with Apache Plasma. Built because of Plotly Dash, useful anywhere.☆84Updated last year
- Spavro is a (sp)eedier avro library -- Spavro is a fork of the official Apache AVRO python 2 implementation with the goal of greatly impr…☆26Updated last year
- A kafka streams client library built on confluent-kafka-python☆67Updated 11 months ago
- DBAPI and SQLAlchemy dialect for Databricks Workspace and SQL Analytics clusters☆22Updated 2 years ago
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated last year
- RedisTimeSeries python client☆99Updated last year
- Generate avro schemas from python classes. Code generation from avro schemas. Serialize/Deserialize python instances with avro schemas☆213Updated this week
- Blazing fast, composable, Pythonic quantile filters.☆135Updated last year
- A tool and library for easily deploying applications on Apache YARN☆142Updated 6 months ago
- Docker images for dask☆231Updated this week
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 2 years ago
- Ray provider for Apache Airflow☆47Updated 7 months ago
- Faust dockerized application☆68Updated last year
- Stream Processing Made Easy☆38Updated 2 years ago
- Distributed event processing for Python based on Redis Streams☆133Updated 4 years ago
- Python DataFrame with fast insert and appends☆74Updated last year
- Apache Avro <-> pandas DataFrame☆134Updated last month
- Fast iterative local development and testing of Apache Airflow workflows☆192Updated 3 months ago
- Helm charts for Dask☆91Updated last week