egehanyorulmaz / reference_etlLinks
Tutorial for easy-to-manage data pipelines with Airflow
☆10Updated 3 years ago
Alternatives and similar repositories for reference_etl
Users that are interested in reference_etl are comparing it to the libraries listed below
Sorting:
- Example repo to create end to end tests for data pipeline.☆25Updated last year
- Creating Data Pipelines with Apache Airflow to manage ETL from Amazon S3 into Amazon Redshift☆14Updated 6 years ago
- A Series of Notebooks on how to start with Kafka and Python☆152Updated 9 months ago
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆96Updated 6 years ago
- Price Crawler - Tracking Price Inflation☆187Updated 5 years ago
- ☆40Updated 2 years ago
- Code for dbt tutorial☆165Updated 2 months ago
- how to unit test your PySpark code☆29Updated 4 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
- Udacity Data Streaming Nanodegree Program☆23Updated 4 years ago
- Project for "Data pipeline design patterns" blog.☆47Updated last year
- Simple stream processing pipeline☆110Updated last year
- Template for Data Engineering and Data Pipeline projects☆114Updated 2 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆158Updated 5 years ago
- ☆88Updated 3 years ago
- Spark, Airflow, Kafka☆25Updated 2 years ago
- End to end data engineering project☆57Updated 3 years ago
- Python ETL demo for Hackforge☆32Updated 2 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆16Updated 6 years ago
- Near real time ETL to populate a dashboard.☆73Updated 2 months ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆106Updated 8 months ago
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆42Updated 2 years ago
- Resources for video demonstrations and blog posts related to DataOps on AWS☆182Updated 3 years ago
- Sample project to demonstrate data engineering best practices☆200Updated last year
- Processing TfL data for bike usage with Google Cloud Platform.☆46Updated 3 years ago
- Docker Airflow - Contains a docker compose file for Airflow 2.0☆69Updated 3 years ago
- Sample pytest tests for testing SQL Server assests.☆46Updated 7 years ago
- Content related to Mastering Postgresql along with videos.☆18Updated 4 years ago
- Code for my "Efficient Data Processing in SQL" book.☆60Updated last year