benjigoldberg / udacity-airflowLinks
Udacity Data Pipeline Exercises
☆15Updated 5 years ago
Alternatives and similar repositories for udacity-airflow
Users that are interested in udacity-airflow are comparing it to the libraries listed below
Sorting:
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆90Updated 3 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- AWS Big Data Certification☆25Updated 5 months ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- Helping you get Airflow running in production.☆9Updated 5 years ago
- Glue VSCode devcontainer setup☆14Updated 2 years ago
- A real-time event pipeline around Kafka Ecosystem for Chicago Transit Authority.☆31Updated last year
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆25Updated 5 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 5 years ago
- Udacity Data Engineering Nanodegree Projects☆11Updated 5 years ago
- Data Quest - Data Engineer Learning and Projects☆24Updated 6 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 6 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆86Updated 2 years ago
- My solutions for the Udacity Data Engineering Nanodegree☆34Updated 5 years ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆87Updated 6 years ago
- ☆16Updated 7 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆51Updated 8 years ago
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 4 years ago
- Data engineering interviews Q&A for data community by data community☆63Updated 5 years ago
- Full stack data engineering tools and infrastructure set-up☆53Updated 4 years ago
- Code Repository for AWS Certified Big Data Specialty 2019 - In Depth and Hands On!, published by Packt☆40Updated last year
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.☆29Updated last year
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- Various data stream/batch process demo with Apache Scala Spark 🚀☆11Updated 5 years ago
- Sharing interesting and noteworthy Data Engineering content☆68Updated 8 years ago
- Here's how to get DataQuest's Data Engineering Track missions' content to work on your localhost. Using data from my Valenbisi ARIMA mode…☆15Updated 6 years ago
- sample code for tech blog post "Porting Flask to FastAPI for ML Model Serving"☆28Updated last year
- ☆18Updated 3 years ago
- 💾 Script to import issues from a JIRA instance into a database.☆56Updated 2 years ago