airscholar / FootballDataEngineeringLinks
An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Factory, Azure Synapse and Tableau.
☆26Updated 2 years ago
Alternatives and similar repositories for FootballDataEngineering
Users that are interested in FootballDataEngineering are comparing it to the libraries listed below
Sorting:
- Data Engineering YouTube Analysis Project by Darshil Parmar☆207Updated last year
- datacamp Data Engineer with Python course. 73 hours/ 19 Courses /2 Skill Assessments☆134Updated 2 years ago
- This repo contains all the code used in the Python for Data Engineering Course☆320Updated last year
- apache-spark-with-databricks-for-data-engineering☆90Updated last year
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆159Updated 2 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆164Updated 2 years ago
- This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science t…☆135Updated 9 months ago
- YouTube tutorial project☆105Updated 2 years ago
- Data Engineering Project with Hadoop HDFS and Kafka☆118Updated last year
- ☆161Updated 3 years ago
- ☆296Updated last year
- Transform data from on-premises SQL Server to Azure Delta Lake Storage for Analytics and Visualization☆19Updated 2 years ago
- ☆375Updated 9 months ago
- ☆142Updated 2 years ago
- tokyo-olympic-azure-data-engineering-project☆215Updated last year
- This is a template you can use for your next data engineering portfolio project.☆181Updated 4 years ago
- For the Coursera specialization https://www.coursera.org/specializations/gcp-data-machine-learning☆97Updated 7 years ago
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.☆202Updated last year
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆365Updated last year
- ☆205Updated 2 years ago
- The resources of the preparation course for Databricks Data Engineer Associate certification exam☆507Updated last month
- ☆22Updated 4 years ago
- Personal Data Engineering Projects☆956Updated 2 years ago
- sql-for-data-engineering-course☆18Updated 2 years ago
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆242Updated 2 years ago
- Data Engineering with Python, published by Packt☆756Updated 2 years ago
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆285Updated 8 months ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆170Updated last month
- Data Engineering with AWS, Published by Packt☆332Updated 2 years ago
- Sample repo for startdataengineering DE 101 free course☆69Updated last year