iamraphson / IMDB-pipeline-projectLinks
☆16Updated last year
Alternatives and similar repositories for IMDB-pipeline-project
Users that are interested in IMDB-pipeline-project are comparing it to the libraries listed below
Sorting:
- Detailed notes and homeworks from 2025 Data Engineering Zoomcamp by Datatalks.Club☆56Updated 11 months ago
- ☆13Updated last year
- Realtime Data Engineering Project☆30Updated last year
- FInal project for data zoom camp 2024☆16Updated last year
- A custom end-to-end analytics platform for customer churn☆11Updated 8 months ago
- Code for "Advanced data transformations in SQL" free live workshop☆89Updated 9 months ago
- Django-based course management platform for Zoomcamps☆78Updated last week
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆98Updated last year
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆203Updated 2 years ago
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆312Updated 11 months ago
- Course Materials for Analytics in Stock Markets Zoomcamp☆826Updated 4 months ago
- Airflow 3 demos from DevRel☆80Updated 6 months ago
- Data Engineering project using Databricks PySpark & Spark SQL for analysing data from Spotify API and present in form of PowerBI report☆40Updated 2 months ago
- Code for "Efficient Data Processing in Spark" Course☆361Updated 3 months ago
- Sample repo for startdataengineering DE 101 free course☆74Updated last year
- ☆106Updated last year
- A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.☆251Updated last month
- Une liste de projets data professionnels pour enrichir ton portfolio☆45Updated last month
- Data Engineering examples for Airflow, Prefect; dbt for BigQuery, Redshift, ClickHouse, Postgres, DuckDB; PySpark for Batch processing; K…☆69Updated this week
- End-to-end data pipeline that ingests, processes, and stores data. It uses Apache Airflow to schedule scripts that fetch data from an API…☆20Updated last year
- Practical Data Engineering: A Hands-On Real-Estate Project Guide☆767Updated last year
- This repository goes over how to handle massive variety in data engineering☆311Updated 3 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆202Updated last month
- This repository helps teach people how to correctly define and create cumulative tables!☆747Updated last year
- 🦆 Batch data pipeline with Airflow, DuckDB, Delta Lake, Trino, MinIO, and Metabase. Full observability and data quality.☆84Updated 3 months ago
- Code for my "Efficient Data Processing in SQL" book.☆60Updated last year
- Production ML rental prediction system.☆50Updated last year
- Data Engineering with Google Cloud Platform - Second Edition, published by Packt☆45Updated last year
- ☆127Updated last year
- Backup for NYC TLC data for the DE Zoomcamp course☆203Updated 3 years ago