PhongHuynh0394 / Spotify-Analysis-with-PySparkLinks
Analyzing Spotify Data with Pyspark and ETL Procedures
☆23Updated 11 months ago
Alternatives and similar repositories for Spotify-Analysis-with-PySpark
Users that are interested in Spotify-Analysis-with-PySpark are comparing it to the libraries listed below
Sorting:
- Nyc_Taxi_Data_Pipeline - DE Project☆120Updated 10 months ago
- Đồ án tốt nghiệp | Data Lakehouse☆22Updated last year
- ELT Data Pipeline implementation in Data Warehousing environment☆26Updated 4 months ago
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆273Updated 6 months ago
- ☆18Updated last year
- An example project demonstrating data engineering workflow for tutorial purposes by Pipeline To Insights.☆10Updated 4 months ago
- A turnkey MLOps pipeline demonstrating how to go from raw events to real-time predictions at scale.☆214Updated 7 months ago
- ☆54Updated last year
- ☆68Updated last year
- ☆28Updated last year
- ☆29Updated last year
- Scalable, cloud-native recommender system with end-to-end MLOps for building, training, and deploying models in research and production☆40Updated last month
- My Setup Development Environment as Data Engineer☆29Updated last month
- ☆61Updated last year
- Crawl data from the TIKI e-commerce, designing a data warehouse, implementing an ETL (Extract, Transform, Load) process, and loading the …☆16Updated 2 years ago
- Realtime Data Engineering Project☆30Updated 7 months ago
- Sample code and documentation for very basic things that I can't remember but want to aggregate in one place☆14Updated 3 years ago
- ☆24Updated last year
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆358Updated last year
- Local Environment to Practice Data Engineering☆143Updated 8 months ago
- Thư viện sách của Xóm - Free & Public 😎☆149Updated 3 weeks ago
- Repo for learning DBT with Snowflake, featuring projects and models for data transformation and automation☆24Updated 5 months ago
- On-premises ELT Pipeline☆28Updated last month
- ☆54Updated 2 months ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆102Updated 5 months ago
- Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake☆205Updated 2 months ago
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆153Updated last year
- ☆84Updated 7 months ago
- This repository will contain all of the resources for the Mage component of the Data Engineering Zoomcamp: https://github.com/DataTalksCl…☆101Updated last year
- ☆90Updated 7 months ago