samsonafo / dtc_dezoomcamp_projectLinks
The goal of this project is to build an ETL pipeline. The data would be processed as a batch (monthly) between 2018-01 and 2021-02.
☆13Updated 3 years ago
Alternatives and similar repositories for dtc_dezoomcamp_project
Users that are interested in dtc_dezoomcamp_project are comparing it to the libraries listed below
Sorting:
- ☆35Updated 2 years ago
- ☆62Updated 3 years ago
- All Data Engineering notebooks from Datacamp course☆115Updated 5 years ago
- ☆32Updated 3 years ago
- The Repository for all code I use in my Data Science and Machine Learning Tutorials on YouTube☆75Updated 2 years ago
- Analysis of the Premier League games and seasons since 1992.☆26Updated 2 years ago
- YouTube tutorial project☆104Updated last year
- ☆21Updated last year
- Final Project of the MLOps Zoomcamp hosted by DataTalksClub.☆26Updated 2 years ago
- An end-to-end project on customer segmentation☆81Updated 2 years ago
- Data Engineer with Python lecture notes from #datacamp.☆46Updated 4 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆103Updated 4 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- ☆87Updated 2 years ago
- This repo contains a list of questions to practice SQL with the Sakila Database.☆10Updated 2 years ago
- ☆21Updated 2 years ago
- ML Zoomcamp fall 2021 homework and stuff☆66Updated 3 years ago
- Business challenge that requires building a data platform for retailer data analytics.☆13Updated 2 years ago
- A Series of Notebooks on how to start with Kafka and Python☆154Updated 3 months ago
- ☆9Updated 2 years ago
- Simple ETL pipeline using Python☆26Updated 2 years ago
- Udacity Data Engineering Nanodegree Program☆52Updated 4 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆147Updated 5 years ago
- Data Quest - Data Engineer Learning and Projects☆24Updated 6 years ago
- Data Engineering on GCP☆35Updated 2 years ago
- Processing TfL data for bike usage with Google Cloud Platform.☆45Updated 2 years ago
- Solution to Data at ANZ virtual internship on Forage☆10Updated 4 years ago
- Book Projects☆24Updated 4 years ago
- A course by DataTalks Club that covers Spark, Kafka, Docker, Airflow, Terraform, DBT, Big Query etc☆13Updated 3 years ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆21Updated 2 years ago