airscholar / FootballDataEngineeringLinks
An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Factory, Azure Synapse and Tableau.
☆25Updated last year
Alternatives and similar repositories for FootballDataEngineering
Users that are interested in FootballDataEngineering are comparing it to the libraries listed below
Sorting:
- sql-for-data-engineering-course☆19Updated 2 years ago
- Data Engineering YouTube Analysis Project by Darshil Parmar☆195Updated last year
- apache-spark-with-databricks-for-data-engineering☆84Updated 11 months ago
- This repo contains all the code used in the Python for Data Engineering Course☆289Updated last year
- ☆139Updated 2 years ago
- ☆22Updated 3 years ago
- YouTube tutorial project☆103Updated last year
- tokyo-olympic-azure-data-engineering-project☆209Updated 10 months ago
- ☆150Updated 3 years ago
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆139Updated last year
- ☆19Updated last year
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆30Updated last year
- data-warehouse-snowflake-for-data-engineering☆17Updated last year
- datacamp Data Engineer with Python course. 73 hours/ 19 Courses /2 Skill Assessments☆124Updated 2 years ago
- Azure Data Factory☆63Updated 2 months ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆161Updated 2 years ago
- This is a template you can use for your next data engineering portfolio project.☆176Updated 3 years ago
- Git Repository☆140Updated 3 months ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆22Updated 2 years ago
- Cool DE Projects☆28Updated last week
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆146Updated 4 years ago
- Learn PySpark from Basics to Advanced. Checkout the YouTube Series : [PySpark - Zero to Hero]☆57Updated 4 months ago
- This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science t…☆123Updated 4 months ago
- ☆22Updated last year
- ☆279Updated 9 months ago
- ☆197Updated last year
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆95Updated 2 months ago
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆252Updated 3 months ago
- Sample repo for startdataengineering DE 101 free course☆62Updated 11 months ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆144Updated last year