ETM1123 / divvy-data-pipelineLinks
☆10Updated 2 years ago
Alternatives and similar repositories for divvy-data-pipeline
Users that are interested in divvy-data-pipeline are comparing it to the libraries listed below
Sorting:
- Final Project of the MLOps Zoomcamp hosted by DataTalksClub.☆26Updated 2 years ago
- ☆36Updated 2 years ago
- ☆12Updated 4 years ago
- Exercises performed as part of the ML Zoomcamp course☆30Updated 3 years ago
- ☆29Updated 2 years ago
- ML Zoomcamp fall 2021 homework and stuff☆66Updated 3 years ago
- ☆88Updated 3 years ago
- ELT for AEMET weather data.☆16Updated 7 months ago
- an end-to-end data pipeline extracting music listening habits and producing an insightful dashboard☆17Updated last year
- Data Engineering Project in GCP☆21Updated 2 years ago
- Awesome list of resources for analytics engineers☆29Updated 3 years ago
- Classifies kitchen stuff items into 6 categories: cups, glasses, plates, spoons, forks and knives☆19Updated 2 years ago
- Predict the number of deaths due to covid19 in the next two weeks☆11Updated 3 years ago
- A project from the ml_ops Zoomcamp (DataTalks) using Semiconductor data☆22Updated 3 years ago
- Sample project to demonstrate data engineering best practices☆198Updated last year
- Simple stream processing pipeline☆110Updated last year
- Data Engineering examples for Airflow, Prefect; dbt for BigQuery, Redshift, ClickHouse, Postgres, DuckDB; PySpark for Batch processing; K…☆68Updated last week
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆23Updated 3 years ago
- ☆40Updated 2 years ago
- Hey this is the repo that has all the queries and data for my video game training series!☆154Updated 3 years ago
- Code for my "Efficient Data Processing in SQL" book.☆60Updated last year
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆104Updated 4 years ago
- Data pipeline that scrapes Rust cheater Steam profiles☆54Updated 3 years ago
- Can we predict how much health insurance will cost using regression?☆11Updated 3 years ago
- ☆44Updated last year
- ☆21Updated 2 years ago
- PySpark Cheatsheet☆36Updated 2 years ago
- PipeRider dbt workshop for DataTalksClub DE Zoomcamp☆18Updated last year
- ☆32Updated 3 years ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆42Updated last year