Aiven-Labs / python-notebooks-for-apache-kafka
A Series of Notebooks on how to start with Kafka and Python
☆154Updated last month
Alternatives and similar repositories for python-notebooks-for-apache-kafka:
Users that are interested in python-notebooks-for-apache-kafka are comparing it to the libraries listed below
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 8 months ago
- Full stack data engineering tools and infrastructure set-up☆51Updated 4 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆101Updated 4 years ago
- Data pipeline that scrapes Rust cheater Steam profiles☆52Updated 3 years ago
- Code snippets for Data Engineering Design Patterns book☆80Updated last month
- Code for dbt tutorial☆156Updated 10 months ago
- Project for "Data pipeline design patterns" blog.☆45Updated 8 months ago
- ☆87Updated 2 years ago
- Docker Airflow - Contains a docker compose file for Airflow 2.0☆65Updated 2 years ago
- ☆34Updated last year
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- Example repo to create end to end tests for data pipeline.☆23Updated 10 months ago
- Template for Data Engineering and Data Pipeline projects☆109Updated 2 years ago
- Delta-Lake, ETL, Spark, Airflow☆47Updated 2 years ago
- The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and …☆85Updated 11 months ago
- ☆40Updated 9 months ago
- Execution of DBT models using Apache Airflow through Docker Compose☆116Updated 2 years ago
- ☆28Updated last year
- Sample project to demonstrate data engineering best practices☆186Updated last year
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆143Updated 4 years ago
- Simple stream processing pipeline☆100Updated 10 months ago
- Step by step instructions to create a production-ready data pipeline☆45Updated 4 months ago
- The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.☆52Updated 2 years ago
- Code for "Advanced data transformations in SQL" free live workshop☆77Updated 6 months ago
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- ☆136Updated 2 years ago
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆67Updated last year
- End to end data engineering project☆54Updated 2 years ago
- PySpark Cheatsheet☆36Updated 2 years ago
- Spark data pipeline that processes movie ratings data.☆28Updated 3 weeks ago