ikucan / pykafarrLinks
A high-performance Python Kafka client. Efficiently from Kafka to Pandas and back.
☆41Updated 6 years ago
Alternatives and similar repositories for pykafarr
Users that are interested in pykafarr are comparing it to the libraries listed below
Sorting:
- A consistent table management library in python☆160Updated 2 years ago
- Derivatives models written with the Tributary data flow library☆24Updated 2 weeks ago
- Apache Avro <-> pandas DataFrame☆138Updated 4 months ago
- A Python implementation of Apache Kafka Streams☆311Updated 6 years ago
- A Cookiecutter template for creating Faust projects quickly.☆70Updated 3 years ago
- Streaming reactive and dataflow graphs in Python☆459Updated last week
- Distributed event processing for Python based on Redis Streams☆136Updated 5 years ago
- Python Rest Client to interact against Schema Registry confluent server☆179Updated last month
- MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.☆113Updated last week
- Pandas ExtensionDType/Array backed by Apache Arrow☆232Updated 2 years ago
- DB API 2 interface for Flight SQL with SQLAlchemy extras.☆43Updated 3 months ago
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆649Updated 2 weeks ago
- Distributed SQL Engine in Python using Dask☆409Updated last year
- A Python framework for data processing on GCP.☆120Updated 8 months ago
- A python wrapper for the KSQL REST API.☆158Updated 2 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆202Updated last week
- Deephaven Community Core☆330Updated last week
- RedisTimeSeries python client☆99Updated 3 years ago
- A kafka streams client library built on confluent-kafka-python☆66Updated 2 years ago
- Docker images for dask☆244Updated 2 weeks ago
- Native Kubernetes integration for Dask☆324Updated 2 months ago
- An extension for Jupyter Lab & Jupyter Notebook to monitor Apache Spark (pyspark) from notebooks☆55Updated 6 months ago
- Jupyter Notebooks in S3 - Jupyter Contents Manager implementation☆256Updated 2 weeks ago
- Deploy dask on YARN clusters☆69Updated last year
- A tool and library for easily deploying applications on Apache YARN☆145Updated last year
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 3 years ago
- DBAPI and SQLAlchemy dialect for Databricks Workspace and SQL Analytics clusters☆22Updated 3 years ago
- A web frontend for scheduling Jupyter notebook reports☆254Updated last year
- Python DataFrame with fast insert and appends☆75Updated last week
- DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.☆267Updated 9 months ago