egehanyorulmaz / reference_etlLinks
Tutorial for easy-to-manage data pipelines with Airflow
☆10Updated 3 years ago
Alternatives and similar repositories for reference_etl
Users that are interested in reference_etl are comparing it to the libraries listed below
Sorting:
- Example repo to create end to end tests for data pipeline.☆25Updated last year
- ☆17Updated last year
- ☆10Updated 7 months ago
- Full stack data engineering tools and infrastructure set-up☆56Updated 4 years ago
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆92Updated 6 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
- Processing TfL data for bike usage with Google Cloud Platform.☆45Updated 3 years ago
- Data pipeline that scrapes Rust cheater Steam profiles☆54Updated 3 years ago
- Code for dbt tutorial☆159Updated 3 months ago
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.☆29Updated 2 years ago
- A Series of Notebooks on how to start with Kafka and Python☆152Updated 6 months ago
- A tutorial for the Great Expectations library.☆71Updated 4 years ago
- ☆40Updated 2 years ago
- ☆13Updated last month
- Project for "Data pipeline design patterns" blog.☆45Updated last year
- Content related to Mastering Postgresql along with videos.☆19Updated 4 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 2 years ago
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in …☆23Updated 3 years ago
- ☆88Updated 2 years ago
- Template for Data Engineering and Data Pipeline projects☆114Updated 2 years ago
- Docker Airflow - Contains a docker compose file for Airflow 2.0☆68Updated 3 years ago
- Simple ETL pipeline using Python☆27Updated 2 years ago
- Sample pytest tests for testing SQL Server assests.☆46Updated 6 years ago
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for …☆138Updated 5 years ago
- From Pandas Dataframe To SQL Table using Psycopg2☆61Updated 3 years ago
- Snowflake Cookbook, published by Packt☆81Updated 2 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆16Updated 5 years ago
- example code and source files for ficpa.org article "Programming for Efficiency"☆22Updated 8 years ago
- ☆45Updated 4 years ago
- Creating Data Pipelines with Apache Airflow to manage ETL from Amazon S3 into Amazon Redshift☆14Updated 6 years ago