airscholar / FootballDataEngineeringLinks
An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Factory, Azure Synapse and Tableau.
☆27Updated 2 years ago
Alternatives and similar repositories for FootballDataEngineering
Users that are interested in FootballDataEngineering are comparing it to the libraries listed below
Sorting:
- Data Engineering YouTube Analysis Project by Darshil Parmar☆209Updated last year
- This repo contains all the code used in the Python for Data Engineering Course☆321Updated last year
- tokyo-olympic-azure-data-engineering-project☆216Updated last year
- apache-spark-with-databricks-for-data-engineering☆90Updated last year
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆166Updated 2 years ago
- datacamp Data Engineer with Python course. 73 hours/ 19 Courses /2 Skill Assessments☆134Updated 2 years ago
- ☆297Updated last year
- Data Engineering Project with Hadoop HDFS and Kafka☆118Updated 2 years ago
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.☆204Updated last year
- This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science t…☆137Updated 10 months ago
- YouTube tutorial project☆105Updated 2 years ago
- This is a template you can use for your next data engineering portfolio project.☆181Updated 4 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆164Updated 2 years ago
- ☆161Updated 3 years ago
- I'm partaking in a Data Engineering Bootcamp / Zoomcamp. I'll store files and progress here.☆108Updated 3 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆173Updated 2 months ago
- Azure Data Factory☆68Updated 3 months ago
- The resources of the preparation course for Databricks Data Engineer Associate certification exam☆522Updated 2 months ago
- Sample repo for startdataengineering DE 101 free course☆71Updated last year
- ☆142Updated 2 years ago
- sql-for-data-engineering-course☆18Updated 2 years ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆25Updated 2 years ago
- ☆376Updated 9 months ago
- Transform data from on-premises SQL Server to Azure Delta Lake Storage for Analytics and Visualization☆18Updated 2 years ago
- ☆81Updated 7 months ago
- For the Coursera specialization https://www.coursera.org/specializations/gcp-data-machine-learning☆97Updated 7 years ago
- ☆208Updated 2 years ago
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆242Updated 2 years ago
- Data Engineering with Python, published by Packt☆760Updated 2 years ago
- Data Engineering project using Databricks PySpark & Spark SQL for analysing data from Spotify API and present in form of PowerBI report☆31Updated 9 months ago