Aiven-Labs / python-apache-kafka-tutorial
A tutorial to learn Apache Kafka and Python
☆23Updated 6 months ago
Alternatives and similar repositories for python-apache-kafka-tutorial:
Users that are interested in python-apache-kafka-tutorial are comparing it to the libraries listed below
- Full stack data engineering tools and infrastructure set-up☆51Updated 4 years ago
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- ☆17Updated 8 months ago
- A curated list of awesome Databricks resources, including Spark☆17Updated 9 months ago
- ☆42Updated last month
- Streaming demo dbt☆17Updated 7 months ago
- ☆36Updated 2 years ago
- A flake8 plugin that detects of usage withColumn in a loop or inside reduce☆27Updated 3 months ago
- Realtime Data Engineering Project☆28Updated 3 months ago
- duckdb-etl-framework☆10Updated 4 months ago
- Utility functions for dbt projects running on Spark☆32Updated 2 months ago
- ☆17Updated 8 months ago
- Databricks ML in Action, Published by Packt☆30Updated 11 months ago
- Repository for Reference for Apache Iceberg LinkedIN Learning Courses☆11Updated 2 months ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆13Updated last year
- New generation opensource data stack☆67Updated 2 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and …☆85Updated 11 months ago
- Code snippets for Data Engineering Design Patterns book☆80Updated last month
- The Ultimate Guide to Snowpark, published by Packt☆14Updated 10 months ago
- Kafka Connect: How to create a real time data pipeline using Change Data Capture (CDC)☆13Updated 4 years ago
- Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.☆57Updated 2 years ago
- Cost Efficient Data Pipelines with DuckDB☆51Updated 8 months ago
- Delta Lake Documentation☆49Updated 10 months ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆27Updated 2 years ago
- A package to run DuckDB queries from Apache Airflow.☆19Updated 10 months ago
- Snowflake - Build and Architect Data Pipelines using AWS, published by Packt☆20Updated 2 years ago
- Yet Another (Spark) ETL Framework☆20Updated last year
- A new Airflow Provider for Fivetran, maintained by Astronomer and Fivetran☆22Updated last week
- Web app using Python FastAPI backend, set up for deployment to Azure Container Apps with Azure PostgreSQL Flexible Server.☆12Updated 3 months ago