samsonafo / dtc_dezoomcamp_project
The goal of this project is to build an ETL pipeline. The data would be processed as a batch (monthly) between 2018-01 and 2021-02.
☆13Updated 3 years ago
Alternatives and similar repositories for dtc_dezoomcamp_project:
Users that are interested in dtc_dezoomcamp_project are comparing it to the libraries listed below
- ☆33Updated last year
- Python ETL demo for Hackforge☆31Updated last year
- ☆32Updated 3 years ago
- This repo contains all code and data for WWCode Python DE workshop Aug 18 and 25 2022☆24Updated 2 years ago
- ☆27Updated last year
- ML Zoomcamp fall 2021 homework and stuff☆64Updated 3 years ago
- ☆87Updated 2 years ago
- Working with Youtube's API to collect video statistics from a channel☆52Updated 3 years ago
- ☆21Updated 2 years ago
- All Data Engineering notebooks from Datacamp course☆115Updated 5 years ago
- Processing TfL data for bike usage with Google Cloud Platform.☆45Updated 2 years ago
- An end-to-end project on customer segmentation☆81Updated 2 years ago
- Business challenge that requires building a data platform for retailer data analytics.☆12Updated 2 years ago
- Final Project of the MLOps Zoomcamp hosted by DataTalksClub.☆26Updated 2 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- Analysis of the Premier League games and seasons since 1992.☆26Updated 2 years ago
- A quick reference guide to the most commonly used patterns and functions in PySpark SQL☆54Updated 3 years ago
- The repository contains all the work including projects, notes, and articles related to ML Engineering while I am learning.☆10Updated 2 years ago
- I have tried to solve some complex SQL interview questions that had been asked in several company. Collected this question from Ankit Ban…☆98Updated 2 years ago
- Case Study's from Danny Ma's Serious SQL Course☆19Updated 2 years ago
- ☆61Updated 3 years ago
- This repo contains commands that data engineers use in day to day work.☆60Updated 2 years ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Updated 2 years ago
- Data analytics interview questions and answers☆60Updated 4 years ago
- Data Engineer with Python lecture notes from #datacamp.☆46Updated 3 years ago
- This repo is meant to make it really easy to analyze the interplays between health and social media use.☆43Updated 2 years ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆45Updated 5 years ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆21Updated last year
- IBM Data Engineering Courses from Coursera☆71Updated last year
- Essential PySpark for Scalable Data Analytics, published by Packt☆43Updated 2 years ago