padilha / nyc-motor-vehicle-collisions
My final project for the Data Engineering Zoomcamp by DataTalksClub.
☆11Updated last year
Alternatives and similar repositories for nyc-motor-vehicle-collisions:
Users that are interested in nyc-motor-vehicle-collisions are comparing it to the libraries listed below
- Code for the Data Engineering Zoomcamp☆47Updated last year
- My notes of the Data Engineering Zoomcamp by DataTalksClub☆36Updated last year
- This repository will contain all of the resources for the Mage component of the Data Engineering Zoomcamp: https://github.com/DataTalksCl…☆98Updated 6 months ago
- ☆41Updated last year
- Project submission for data engineering zoomcamp 2023 - https://github.com/DataTalksClub/data-engineering-zoomcamp☆8Updated last year
- ☆28Updated last year
- Final Project of the MLOps Zoomcamp hosted by DataTalksClub.☆26Updated 2 years ago
- Backup for NYC TLC data for the DE Zoomcamp course☆171Updated 2 years ago
- ☆32Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆65Updated 8 months ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆79Updated 6 months ago
- ☆339Updated last year
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆256Updated 7 months ago
- ☆135Updated 2 years ago
- This is a template you can use for your next data engineering portfolio project.☆173Updated 3 years ago
- Candace's Data Engineering Zoomcamp files and notes☆18Updated last year
- Project for "Data pipeline design patterns" blog.☆43Updated 6 months ago
- Sample repo for startdataengineering DE 101 free course☆48Updated 7 months ago
- Sample project to demonstrate data engineering best practices☆179Updated 11 months ago
- PySpark Tutorial for Beginners - Practical Examples in Jupyter Notebook with Spark version 3.4.1. The tutorial covers various topics like…☆100Updated last year
- Surfalytics projces on Data Engineering and Analytics☆62Updated 2 weeks ago
- End to end data engineering project☆53Updated 2 years ago
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆102Updated this week
- ☆37Updated last year
- ☆72Updated last year
- Classifies kitchen stuff items into 6 categories: cups, glasses, plates, spoons, forks and knives☆19Updated 2 years ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆34Updated last year
- Simple ETL pipeline using Python☆25Updated last year
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆137Updated 4 years ago
- Data pipeline for uploading, preprocessing, and visualising COVID19 data☆18Updated last year