karolina-sowinska / free-data-engineering-course-for-beginnersView external linksLinks
Learn the entire ETL process based on Spotify API data
☆268Feb 1, 2021Updated 5 years ago
Alternatives and similar repositories for free-data-engineering-course-for-beginners
Users that are interested in free-data-engineering-course-for-beginners are comparing it to the libraries listed below
Sorting:
- This project involves an ETL (Extract, Transform, Load) process to analyze sleep data exported from Apple Health☆29Apr 29, 2023Updated 2 years ago
- Construindo Pipeline de Dados com Astro Python SDK, dbt & Apache Airflow☆10Mar 20, 2024Updated last year
- ☆13Jun 12, 2024Updated last year
- Análise de Dados Abertos da Prova Brasil 2011 com Airflow, S3, Redshift e Metabase☆15Jun 28, 2023Updated 2 years ago
- Beginner data engineering project - batch edition☆564Jan 22, 2025Updated last year
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29May 2, 2023Updated 2 years ago
- Data Engineering Hours With Experts Coding Challenge☆12Sep 14, 2023Updated 2 years ago
- Conteúdo das aulas da turma 6 do bootcamp de engenharia de dados da How☆12Sep 16, 2021Updated 4 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆10Jun 6, 2021Updated 4 years ago
- Example end to end data engineering project.☆1,384Dec 8, 2022Updated 3 years ago
- Data Engineering with Python, published by Packt☆783Jan 30, 2023Updated 3 years ago
- ☆147Jan 31, 2023Updated 3 years ago
- Your Top Spotify Listening Habits, Favorite Artists, and Song Recommendations in a Playlist🎧🎶☆19May 19, 2025Updated 8 months ago
- Build machine learning models with scikit-learn power tools☆11Oct 28, 2022Updated 3 years ago
- This is an all-in-one repository for Data Engineers, ideal for beginners & interview preparation, which includes Python as the main Progr…☆31Mar 21, 2023Updated 2 years ago
- Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake developme…☆1,814Aug 26, 2022Updated 3 years ago
- This repo contains commands that data engineers use in day to day work.☆61Feb 4, 2023Updated 3 years ago
- Python ETL demo for Hackforge☆32Oct 11, 2023Updated 2 years ago
- Data Visualizations for New York City☆11Apr 15, 2020Updated 5 years ago
- Criando Lambda Functions para Ingerir Dados de APIs com AWS CDK☆13Dec 1, 2021Updated 4 years ago
- Logging in with Scrapy☆14Jan 26, 2018Updated 8 years ago
- Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…☆57Oct 20, 2022Updated 3 years ago
- Personal Data Engineering Projects☆989Feb 8, 2023Updated 3 years ago
- Data Engineering YouTube Analysis Project by Darshil Parmar☆227Dec 8, 2023Updated 2 years ago
- Lecture notes, lab notes, and links to helpful resources to pass Google Certification Exam for Professional Data Engineer.☆18Aug 12, 2022Updated 3 years ago
- Analysis of SQL Leetcode and classic interview questions. Common pitfalls, anti-patterns and handy tricks are discussed. Sample databases…☆47Sep 5, 2021Updated 4 years ago
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆15Apr 29, 2021Updated 4 years ago
- Near real time ETL to populate a dashboard.☆73Sep 9, 2025Updated 5 months ago
- Data engineering interviews Q&A for data community by data community☆66Jun 7, 2020Updated 5 years ago
- An Awesome List of Open-Source Data Engineering Projects☆3,016Oct 4, 2024Updated last year
- ☆21Nov 21, 2023Updated 2 years ago
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆245Jan 1, 2023Updated 3 years ago
- A simple bot to answer questions on my personal website. (In development)☆42May 28, 2021Updated 4 years ago
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆28Jul 23, 2020Updated 5 years ago
- Statistics and Probability with Python for Everyone☆19Nov 10, 2019Updated 6 years ago
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for …☆141Apr 18, 2020Updated 5 years ago
- For this project I am creating an ETL (Extract, Transform, and Load) pipeline using Python, RegEx, and SQL Database. The goal is to retri…☆25Feb 9, 2021Updated 5 years ago
- Multithreading VS Multiprocessing in Python.☆50Dec 3, 2018Updated 7 years ago
- Code for Data Pipelines with Apache Airflow☆812Aug 15, 2024Updated last year