Aiven-Labs / python-notebooks-for-apache-kafka
A Series of Notebooks on how to start with Kafka and Python
☆154Updated 2 months ago
Alternatives and similar repositories for python-notebooks-for-apache-kafka
Users that are interested in python-notebooks-for-apache-kafka are comparing it to the libraries listed below
Sorting:
- Project for "Data pipeline design patterns" blog.☆45Updated 9 months ago
- Example repo to create end to end tests for data pipeline.☆24Updated 11 months ago
- End to end data engineering project☆54Updated 2 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆87Updated 4 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆144Updated 4 years ago
- Docker Airflow - Contains a docker compose file for Airflow 2.0☆65Updated 2 years ago
- ☆34Updated last year
- Data pipeline that scrapes Rust cheater Steam profiles☆52Updated 3 years ago
- Code snippets for Data Engineering Design Patterns book☆106Updated last month
- Simple stream processing pipeline☆102Updated 10 months ago
- This repo contains commands that data engineers use in day to day work.☆60Updated 2 years ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆60Updated last year
- ☆107Updated 3 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 9 months ago
- Full stack data engineering tools and infrastructure set-up☆52Updated 4 years ago
- ☆87Updated 2 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- Simple ETL pipeline using Python☆26Updated last year
- Near real time ETL to populate a dashboard.☆72Updated 10 months ago
- ☆65Updated 2 weeks ago
- streaming eight subreddits from reddit api using kafka producer & spark structured streaming.☆19Updated last month
- The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and …☆85Updated last year
- ☆26Updated 3 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- Code for dbt tutorial☆157Updated 11 months ago
- Code for "Advanced data transformations in SQL" free live workshop☆79Updated last week
- PySpark Cheatsheet☆36Updated 2 years ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 5 years ago
- Repository for Apache Spark course at Team Data Science☆16Updated 4 years ago
- Here I will be exploring various tools and methods that are used in data engineering process with Python.☆22Updated 4 years ago