PhongHuynh0394 / Spotify-Analysis-with-PySparkLinks
Analyzing Spotify Data with Pyspark and ETL Procedures
☆23Updated last year
Alternatives and similar repositories for Spotify-Analysis-with-PySpark
Users that are interested in Spotify-Analysis-with-PySpark are comparing it to the libraries listed below
Sorting:
- Nyc_Taxi_Data_Pipeline - DE Project☆129Updated last year
- Đồ án tốt nghiệp | Data Lakehouse☆26Updated last week
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆287Updated 8 months ago
- ☆55Updated last year
- A turnkey MLOps pipeline demonstrating how to go from raw events to real-time predictions at scale.☆224Updated 2 weeks ago
- Repo for learning DBT with Snowflake, featuring projects and models for data transformation and automation☆25Updated 7 months ago
- ELT Data Pipeline implementation in Data Warehousing environment☆28Updated 6 months ago
- ☆28Updated last year
- On-premises ELT Pipeline☆30Updated 3 months ago
- My Setup Development Environment as Data Engineer☆30Updated 3 months ago
- ☆17Updated last year
- ☆60Updated last year
- ☆10Updated last year
- ☆27Updated last year
- Thư viện sách của Xóm - Free & Public 😎☆170Updated last month
- Scalable, cloud-native recommender system with end-to-end MLOps for building, training, and deploying models in research and production☆51Updated 3 months ago
- ☆68Updated last year
- Sample code and documentation for very basic things that I can't remember but want to aggregate in one place☆14Updated 4 years ago
- ☆23Updated last year
- End-to-end Data Project (DA/DS/DE/MLOps) - retail/e-commerce - interpretable dynamic clustering☆15Updated 3 months ago
- ☆57Updated 4 months ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆143Updated 2 years ago
- Local Environment to Practice Data Engineering☆141Updated 10 months ago
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆164Updated 2 years ago
- FInal project for data zoom camp 2024☆16Updated last year
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆364Updated last year
- ☆63Updated last year
- End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore,…☆46Updated last year
- Production ML rental prediction system.☆49Updated last year
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆22Updated 2 years ago