mtpatter / time-series-kafka-demo
Fully reproducible, Dockerized, step-by-step, tutorial on how to mock a "real-time" Kafka data stream from a timestamped csv file. Detailed blog post published on Towards Data Science.
☆39Updated 3 years ago
Alternatives and similar repositories for time-series-kafka-demo:
Users that are interested in time-series-kafka-demo are comparing it to the libraries listed below
- Delta-Lake, ETL, Spark, Airflow☆46Updated 2 years ago
- This is my Apache Airflow Local development setup on Windows 10 WSL2/Mac using docker-compose. It will also include some sample DAGs and …☆28Updated last year
- build dw with dbt☆43Updated 5 months ago
- ☆12Updated 3 years ago
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆23Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆50Updated 4 years ago
- A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation an…☆23Updated last year
- End to end data engineering project☆53Updated 2 years ago