samsonafo / dtc_dezoomcamp_projectLinks
The goal of this project is to build an ETL pipeline. The data would be processed as a batch (monthly) between 2018-01 and 2021-02.
☆14Updated 3 years ago
Alternatives and similar repositories for dtc_dezoomcamp_project
Users that are interested in dtc_dezoomcamp_project are comparing it to the libraries listed below
Sorting:
- All Data Engineering notebooks from Datacamp course☆115Updated 5 years ago
- ☆35Updated 2 years ago
- ☆32Updated 3 years ago
- I'm partaking in a Data Engineering Bootcamp / Zoomcamp. I'll store files and progress here.☆108Updated 3 years ago
- YouTube tutorial project☆104Updated 2 years ago
- Python ETL demo for Hackforge☆32Updated 2 years ago
- ML Zoomcamp fall 2021 homework and stuff☆66Updated 3 years ago
- Processing TfL data for bike usage with Google Cloud Platform.☆45Updated 3 years ago
- ☆88Updated 3 years ago
- This repo contains commands that data engineers use in day to day work.☆61Updated 2 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- Price Crawler - Tracking Price Inflation☆188Updated 5 years ago
- ☆360Updated 2 years ago
- ☆206Updated 2 years ago
- Simple ETL pipeline using Python☆28Updated 2 years ago
- I have tried to solve some complex SQL interview questions that had been asked in several company. Collected this question from Ankit Ban…☆100Updated 3 years ago
- ☆29Updated last year
- PySpark Tutorial for Beginners - Practical Examples in Jupyter Notebook with Spark version 3.4.1. The tutorial covers various topics like…☆134Updated 2 years ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆24Updated 2 years ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆27Updated 2 years ago
- IBM Data Engineering Courses from Coursera☆71Updated 2 years ago
- Leetcode SQL Solutions☆187Updated 2 years ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 6 years ago
- This is the first project where we worked on apache spark, In this project what we have done is that we downloaded the datasets from KAGG…☆21Updated 4 years ago
- Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…☆57Updated 2 years ago
- Data Engineering with AWS, Published by Packt☆331Updated 2 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆104Updated 4 years ago
- Jupyter Notebook from Selenium Tutorial: Scraping Glassdoor.com"☆96Updated 2 years ago
- ☆40Updated 2 years ago
- PySpark Cheatsheet☆36Updated 2 years ago