egehanyorulmaz / reference_etl
Tutorial for easy-to-manage data pipelines with Airflow
☆10Updated 2 years ago
Alternatives and similar repositories for reference_etl:
Users that are interested in reference_etl are comparing it to the libraries listed below
- Example repo to create end to end tests for data pipeline.☆22Updated 9 months ago
- Creating Data Pipelines with Apache Airflow to manage ETL from Amazon S3 into Amazon Redshift☆14Updated 5 years ago
- ☆17Updated 9 months ago
- Example of an ETL Pipeline using Airflow☆34Updated 7 years ago
- Full stack data engineering tools and infrastructure set-up☆50Updated 4 years ago
- Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand☆47Updated last year
- ☆36Updated 2 years ago
- A Series of Notebooks on how to start with Kafka and Python☆154Updated last month
- Python ETL demo for Hackforge☆31Updated last year
- Content for a talk on "The wonderful world of data quality tools in Python"☆19Updated 3 years ago
- Project for "Data pipeline design patterns" blog.☆45Updated 7 months ago
- ☆17Updated 7 months ago
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.☆29Updated last year
- Source code for 'Building a Data Warehouse' by Vincent Rainardi☆30Updated 7 years ago
- Public source code for the Udemy online course Apache Airflow: Complete Hands-On Beginner to Advanced Class.☆63Updated 4 years ago
- ☆40Updated 8 months ago
- Spark data pipeline that processes movie ratings data.☆28Updated this week
- Content related to Mastering Postgresql along with videos.☆15Updated 3 years ago
- A repo to track data engineering projects☆13Updated 2 years ago
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in …☆21Updated 2 years ago
- Delta-Lake, ETL, Spark, Airflow☆46Updated 2 years ago
- Simple ETL pipeline using Python☆25Updated last year
- End-to-end ELT data engineering project☆20Updated 2 years ago
- build dw with dbt☆43Updated 5 months ago
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆82Updated 5 years ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 5 years ago
- Cost Efficient Data Pipelines with DuckDB☆51Updated 7 months ago
- Code to demonstrate data engineering metadata & logging best practices☆16Updated last year
- PySpark-ETL☆23Updated 5 years ago