PhongHuynh0394 / Spotify-Analysis-with-PySparkLinks
Analyzing Spotify Data with Pyspark and ETL Procedures
☆23Updated last year
Alternatives and similar repositories for Spotify-Analysis-with-PySpark
Users that are interested in Spotify-Analysis-with-PySpark are comparing it to the libraries listed below
Sorting:
- ☆17Updated last year
- Nyc_Taxi_Data_Pipeline - DE Project☆133Updated last year
- A turnkey MLOps pipeline demonstrating how to go from raw events to real-time predictions at scale.☆232Updated 3 months ago
- Repo for learning DBT with Snowflake, featuring projects and models for data transformation and automation☆26Updated 10 months ago
- ELT Data Pipeline implementation in Data Warehousing environment☆30Updated 8 months ago
- Đồ án tốt nghiệp | Data Lakehouse☆34Updated last week
- ☆56Updated last year
- ☆10Updated last year
- ☆58Updated last year
- Thư viện sách của Xóm - Free & Public 😎☆178Updated 4 months ago
- ☆27Updated last year
- ☆28Updated 2 years ago
- Scalable, cloud-native recommender system with end-to-end MLOps for building, training, and deploying models in research and production☆52Updated 5 months ago
- Realtime Data Engineering Project☆30Updated last year
- ☆71Updated 2 years ago
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆310Updated 11 months ago
- Local Environment to Practice Data Engineering☆143Updated last year
- ☆55Updated 7 months ago
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆203Updated 2 years ago
- ☆23Updated last year
- On-premises ELT Pipeline☆31Updated 6 months ago
- My Setup Development Environment as Data Engineer☆34Updated 5 months ago
- Sample code and documentation for very basic things that I can't remember but want to aggregate in one place☆13Updated 4 years ago
- ☆53Updated 4 months ago
- Data Science Handbook☆291Updated 4 years ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆144Updated 2 years ago
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆373Updated 2 years ago
- Scalable Realtime Credit Card Fraud Detection (CCFD) system☆72Updated 3 months ago
- This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spar…☆42Updated 2 years ago
- ☆63Updated last year