ikucan / pykafarrLinks
A high-performance Python Kafka client. Efficiently from Kafka to Pandas and back.
☆40Updated 6 years ago
Alternatives and similar repositories for pykafarr
Users that are interested in pykafarr are comparing it to the libraries listed below
Sorting:
- A Python implementation of Apache Kafka Streams☆310Updated 6 years ago
- A Cookiecutter template for creating Faust projects quickly.☆70Updated 2 years ago
- Streaming reactive and dataflow graphs in Python☆458Updated 2 weeks ago
- Derivatives models written with the Tributary data flow library☆24Updated last week
- A consistent table management library in python☆160Updated 2 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆201Updated last month
- A Python framework for data processing on GCP.☆119Updated 5 months ago
- Docker images for dask☆242Updated last week
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆644Updated 3 weeks ago
- A kafka streams client library built on confluent-kafka-python☆66Updated last year
- Pandas ExtensionDType/Array backed by Apache Arrow☆232Updated 2 years ago
- MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.☆112Updated last week
- Read Delta tables without any Spark☆47Updated last year
- A web frontend for scheduling Jupyter notebook reports☆254Updated 9 months ago
- A python wrapper for the KSQL REST API.☆158Updated 2 years ago
- Python Rest Client to interact against Schema Registry confluent server☆179Updated last week
- Native Kubernetes integration for Dask☆323Updated 2 months ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 3 years ago
- Distributed SQL Engine in Python using Dask☆408Updated last year
- Deephaven Community Core☆311Updated this week
- Event-driven data pipelines☆144Updated last year
- Stream Processing using Polars☆31Updated 2 years ago
- Generate avro schemas from python dataclasses, Pydantic models and Faust Records. Code generation from avro schemas. Serialize/Deserializ…☆241Updated 2 weeks ago
- Deploy dask on YARN clusters☆69Updated last year
- Fast Avro for Python☆680Updated last week
- Pylint plugin for static code analysis on Airflow code☆96Updated 4 years ago
- Distributed event processing for Python based on Redis Streams☆135Updated 5 years ago
- Shared-memory Python object namespace with Apache Plasma. Built because of Plotly Dash, useful anywhere.☆83Updated 8 months ago
- Jupyter Notebooks in S3 - Jupyter Contents Manager implementation☆256Updated 2 weeks ago
- Python stream processing for analytics☆40Updated 2 months ago