airscholar / FootballDataEngineeringLinks
An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Factory, Azure Synapse and Tableau.
☆23Updated last year
Alternatives and similar repositories for FootballDataEngineering
Users that are interested in FootballDataEngineering are comparing it to the libraries listed below
Sorting:
- This repo contains all the code used in the Python for Data Engineering Course☆300Updated last year
- Data Engineering YouTube Analysis Project by Darshil Parmar☆200Updated last year
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆150Updated last year
- ☆152Updated 3 years ago
- tokyo-olympic-azure-data-engineering-project☆214Updated last year
- apache-spark-with-databricks-for-data-engineering☆89Updated last year
- ☆284Updated 11 months ago
- datacamp Data Engineer with Python course. 73 hours/ 19 Courses /2 Skill Assessments☆125Updated 2 years ago
- ☆356Updated 6 months ago
- ☆142Updated 2 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆161Updated 2 years ago
- YouTube tutorial project☆105Updated last year
- ☆22Updated 3 years ago
- sql-for-data-engineering-course☆19Updated 2 years ago
- This is a template you can use for your next data engineering portfolio project.☆180Updated 3 years ago
- Data Engineering Project with Hadoop HDFS and Kafka☆114Updated last year
- I'm partaking in a Data Engineering Bootcamp / Zoomcamp. I'll store files and progress here.☆107Updated 3 years ago
- This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science t…☆126Updated 6 months ago
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆356Updated last year
- The resources of the preparation course for Databricks Data Engineer Associate certification exam☆459Updated last month
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.☆178Updated 11 months ago
- Sample repo for startdataengineering DE 101 free course☆69Updated last year
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆232Updated 2 years ago
- ☆203Updated last year
- A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.☆246Updated last year
- Sample project to demonstrate data engineering best practices☆195Updated last year
- A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!☆728Updated 3 years ago
- Transform data from on-premises SQL Server to Azure Delta Lake Storage for Analytics and Visualization☆15Updated 2 years ago
- Data Engineering with Python, published by Packt☆727Updated 2 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆153Updated last year