ikucan / pykafarrLinks
A high-performance Python Kafka client. Efficiently from Kafka to Pandas and back.
☆40Updated 6 years ago
Alternatives and similar repositories for pykafarr
Users that are interested in pykafarr are comparing it to the libraries listed below
Sorting:
- Derivatives models written with the Tributary data flow library☆24Updated this week
- A consistent table management library in python☆160Updated 2 years ago
- A Cookiecutter template for creating Faust projects quickly.☆70Updated 2 years ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆232Updated 2 years ago
- Docker images for dask☆243Updated 2 weeks ago
- Streaming reactive and dataflow graphs in Python☆458Updated this week
- A Python implementation of Apache Kafka Streams☆310Updated 6 years ago
- Distributed SQL Engine in Python using Dask☆408Updated last year
- Python Rest Client to interact against Schema Registry confluent server☆179Updated last week
- Pylint plugin for static code analysis on Airflow code☆96Updated 5 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆201Updated 2 months ago
- Apache Avro <-> pandas DataFrame☆138Updated 2 months ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 3 years ago
- Deploy dask on YARN clusters☆69Updated last year
- A python wrapper for the KSQL REST API.☆158Updated 2 years ago
- Dockerized setup for testing code on realistic hadoop clusters☆27Updated 5 years ago
- Native Kubernetes integration for Dask☆322Updated 2 weeks ago
- A Python framework for data processing on GCP.☆119Updated 6 months ago
- A web frontend for scheduling Jupyter notebook reports☆254Updated 11 months ago
- MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.☆111Updated 2 weeks ago
- A kafka streams client library built on confluent-kafka-python☆66Updated 2 years ago
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆647Updated last week
- A tool and library for easily deploying applications on Apache YARN☆146Updated last year
- Python stream processing for analytics☆41Updated last month
- A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (in…☆254Updated 3 months ago
- Deephaven Community Core☆317Updated last week
- Python DataFrame with fast insert and appends☆75Updated 2 months ago
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated 2 years ago
- SQL on dataframes - pandas and dask☆64Updated 7 years ago
- Airflow declarative DAGs via YAML☆133Updated 2 years ago