egehanyorulmaz / reference_etlLinks
Tutorial for easy-to-manage data pipelines with Airflow
☆10Updated 3 years ago
Alternatives and similar repositories for reference_etl
Users that are interested in reference_etl are comparing it to the libraries listed below
Sorting:
- Example repo to create end to end tests for data pipeline.☆25Updated last year
- Code for dbt tutorial☆161Updated 2 months ago
- A Series of Notebooks on how to start with Kafka and Python☆152Updated 8 months ago
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.☆29Updated 2 years ago
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆93Updated 6 years ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆64Updated 2 years ago
- Project for "Data pipeline design patterns" blog.☆46Updated last year
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
- ☆40Updated 2 years ago
- ☆13Updated 4 months ago
- Snowflake Cookbook, published by Packt☆81Updated 2 years ago
- ☆88Updated 3 years ago
- Delta-Lake, ETL, Spark, Airflow☆48Updated 3 years ago
- Sample pytest tests for testing SQL Server assests.☆46Updated 7 years ago
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- A simple VS Code devcontainer setup for local PySpark development☆55Updated 2 years ago
- Simple stream processing pipeline☆110Updated last year
- Earthquakes API Live Data Project - Data Engineering Zoomcamp Project☆15Updated 2 years ago
- Processing TfL data for bike usage with Google Cloud Platform.☆46Updated 3 years ago
- Materials of the Official Helm Chart Webinar☆27Updated 4 years ago
- Data pipeline that scrapes Rust cheater Steam profiles☆54Updated 3 years ago
- Code snippets and tools published on the blog at lifearounddata.com☆12Updated 5 years ago
- Docker Airflow - Contains a docker compose file for Airflow 2.0☆69Updated 3 years ago
- Template for Data Engineering and Data Pipeline projects☆114Updated 2 years ago
- Sample project to demonstrate data engineering best practices☆198Updated last year
- This repo contains DAGs demonstrating a variety of ELT patterns using Airflow along with dbt.☆11Updated 2 years ago
- how to unit test your PySpark code☆29Updated 4 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆89Updated 4 years ago
- A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation an…☆23Updated last year
- Resources for video demonstrations and blog posts related to DataOps on AWS☆181Updated 3 years ago