Learn the entire ETL process based on Spotify API data
☆268Feb 1, 2021Updated 5 years ago
Alternatives and similar repositories for free-data-engineering-course-for-beginners
Users that are interested in free-data-engineering-course-for-beginners are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Análise de Dados Abertos da Prova Brasil 2011 com Airflow, S3, Redshift e Metabase☆15Jun 28, 2023Updated 2 years ago
- Beginner data engineering project - batch edition☆581Apr 13, 2026Updated 2 weeks ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29May 2, 2023Updated 2 years ago
- This is an all-in-one repository for Data Engineers, ideal for beginners & interview preparation, which includes Python as the main Progr…☆32Mar 21, 2023Updated 3 years ago
- Example end to end data engineering project.☆1,412Dec 8, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Nov 9, 2022Updated 3 years ago
- Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake developme…☆1,903Aug 26, 2022Updated 3 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆10Jun 6, 2021Updated 4 years ago
- ☆146Jan 31, 2023Updated 3 years ago
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for …☆139Apr 18, 2020Updated 6 years ago
- Near real time ETL to populate a dashboard.☆75Sep 9, 2025Updated 7 months ago
- Criando Lambda Functions para Ingerir Dados de APIs com AWS CDK☆13Dec 1, 2021Updated 4 years ago
- En este repositorio encontrarás la mayoria del código que hago en mi canal de Youtube.☆62Jun 16, 2023Updated 2 years ago
- Personal Data Engineering Projects☆1,011Feb 8, 2023Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.☆1,506Mar 9, 2020Updated 6 years ago
- Build machine learning models with scikit-learn power tools☆11Oct 28, 2022Updated 3 years ago
- Spark-based pipeline to extract and parse monthly games from the Lichess database.☆21Sep 22, 2025Updated 7 months ago
- Data Engineering Practice Problems☆2,653Jan 8, 2025Updated last year
- This repository contains analysis of IMDB data from multiple sources and analysis of movies/cast/box office revenues, movie brands and fr…☆31Jun 1, 2020Updated 5 years ago
- This repo consists of all important concepts for data engineers.☆11Dec 24, 2024Updated last year
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆52Aug 23, 2019Updated 6 years ago
- An Awesome List of Open-Source Data Engineering Projects☆3,166Oct 4, 2024Updated last year
- Code for Data Pipelines with Apache Airflow☆818Aug 15, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Python ETL demo for Hackforge☆32Oct 11, 2023Updated 2 years ago
- Demonstrations and visualizations of sorting algorithms (Python and C++).☆22Oct 21, 2018Updated 7 years ago
- A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)☆12May 2, 2021Updated 4 years ago
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆15Apr 29, 2021Updated 5 years ago
- EtLT of my own Strava data using the Strava API, MySQL, Python, S3, Redshift, and Airflow☆34Jun 21, 2022Updated 3 years ago
- For this project I am creating an ETL (Extract, Transform, and Load) pipeline using Python, RegEx, and SQL Database. The goal is to retri…☆26Feb 9, 2021Updated 5 years ago
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆27Jul 23, 2020Updated 5 years ago
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆268Jan 1, 2023Updated 3 years ago
- IRIS Data Classification using Streamlit package....☆10Dec 8, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is a template you can use for your next data engineering portfolio project.☆191Sep 10, 2021Updated 4 years ago
- Multithreading VS Multiprocessing in Python.☆50Dec 3, 2018Updated 7 years ago
- ☆340Aug 13, 2024Updated last year
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆169Dec 8, 2022Updated 3 years ago
- ☆11Mar 15, 2023Updated 3 years ago
- my favorite project☆17Jul 3, 2023Updated 2 years ago
- ☆10Jul 1, 2020Updated 5 years ago