ikucan / pykafarrLinks
A high-performance Python Kafka client. Efficiently from Kafka to Pandas and back.
☆40Updated 6 years ago
Alternatives and similar repositories for pykafarr
Users that are interested in pykafarr are comparing it to the libraries listed below
Sorting:
- A consistent table management library in python☆160Updated 2 years ago
- A Python implementation of Apache Kafka Streams☆310Updated 6 years ago
- Apache Avro <-> pandas DataFrame☆138Updated 2 months ago
- Streaming reactive and dataflow graphs in Python☆458Updated 3 weeks ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆232Updated 2 years ago
- Derivatives models written with the Tributary data flow library☆24Updated last week
- Jupyter Notebooks in S3 - Jupyter Contents Manager implementation☆256Updated 2 months ago
- A Python framework for data processing on GCP.☆119Updated 7 months ago
- A Cookiecutter template for creating Faust projects quickly.☆70Updated 2 years ago
- Shared-memory Python object namespace with Apache Plasma. Built because of Plotly Dash, useful anywhere.☆83Updated 10 months ago
- Fast iterative local development and testing of Apache Airflow workflows☆202Updated 3 months ago
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆649Updated 2 weeks ago
- Distributed SQL Engine in Python using Dask☆408Updated last year
- SQL on dataframes - pandas and dask☆64Updated 7 years ago
- Distributed event processing for Python based on Redis Streams☆135Updated 5 years ago
- DBAPI and SQLAlchemy dialect for Databricks Workspace and SQL Analytics clusters☆22Updated 3 years ago
- A web frontend for scheduling Jupyter notebook reports☆254Updated 11 months ago
- Generate avro schemas from python dataclasses, Pydantic models and Faust Records. Code generation from avro schemas. Serialize/Deserializ…☆246Updated last week
- Python stream processing for analytics☆41Updated last month
- Python Rest Client to interact against Schema Registry confluent server☆179Updated 3 weeks ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 3 years ago
- Fast Avro for Python☆687Updated 2 weeks ago
- Deephaven Community Core☆321Updated last week
- Pylint plugin for static code analysis on Airflow code☆96Updated 5 years ago
- Deploy dask on YARN clusters☆69Updated last year
- Python DataFrame with fast insert and appends☆76Updated 2 weeks ago
- A tool and library for easily deploying applications on Apache YARN☆146Updated last year
- A kafka streams client library built on confluent-kafka-python☆66Updated 2 years ago
- Docker images for dask☆243Updated 2 weeks ago
- MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.☆111Updated last week