ikucan / pykafarrLinks
A high-performance Python Kafka client. Efficiently from Kafka to Pandas and back.
☆41Updated 6 years ago
Alternatives and similar repositories for pykafarr
Users that are interested in pykafarr are comparing it to the libraries listed below
Sorting:
- Distributed SQL Engine in Python using Dask☆408Updated last year
- Streaming reactive and dataflow graphs in Python☆458Updated 2 weeks ago
- A Python implementation of Apache Kafka Streams☆310Updated 6 years ago
- A consistent table management library in python☆160Updated 2 years ago
- Derivatives models written with the Tributary data flow library☆24Updated 2 weeks ago
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆649Updated last week
- Pandas ExtensionDType/Array backed by Apache Arrow☆232Updated 2 years ago
- A Python framework for data processing on GCP.☆120Updated 8 months ago
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆301Updated last year
- Deploy dask on YARN clusters☆69Updated last year
- Docker images for dask☆244Updated last month
- Apache Avro <-> pandas DataFrame☆138Updated 3 months ago
- A Cookiecutter template for creating Faust projects quickly.☆70Updated 3 years ago
- Native Kubernetes integration for Dask☆323Updated last month
- MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.☆112Updated last week
- Deephaven Community Core☆328Updated last week
- Real-time stream processing for python☆1,286Updated last year
- A tool and library for easily deploying applications on Apache YARN☆146Updated last year
- Python stream processing for analytics☆41Updated this week
- Distributed event processing for Python based on Redis Streams☆135Updated 5 years ago
- A kafka streams client library built on confluent-kafka-python☆66Updated 2 years ago
- A fast PostgreSQL Database Client Library for Python/asyncio.☆46Updated last year
- Fast iterative local development and testing of Apache Airflow workflows☆202Updated 3 months ago
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆145Updated last year
- Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet☆197Updated 2 years ago
- Monitor Apache Spark from Jupyter Notebook☆172Updated 3 years ago
- Pylint plugin for static code analysis on Airflow code☆96Updated 5 years ago
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆258Updated this week
- Python Rest Client to interact against Schema Registry confluent server☆179Updated 2 weeks ago
- A web frontend for scheduling Jupyter notebook reports☆254Updated last year