airscholar / FootballDataEngineering
An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Factory, Azure Synapse and Tableau.
☆25Updated last year
Alternatives and similar repositories for FootballDataEngineering
Users that are interested in FootballDataEngineering are comparing it to the libraries listed below
Sorting:
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆136Updated last year
- Data Engineering YouTube Analysis Project by Darshil Parmar☆195Updated last year
- sql-for-data-engineering-course☆19Updated 2 years ago
- This repo contains all the code used in the Python for Data Engineering Course☆285Updated last year
- apache-spark-with-databricks-for-data-engineering☆83Updated 10 months ago
- YouTube tutorial project☆102Updated last year
- ☆151Updated 2 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆161Updated 2 years ago
- datacamp Data Engineer with Python course. 73 hours/ 19 Courses /2 Skill Assessments☆123Updated 2 years ago
- ☆195Updated last year
- Data Engineering Project with Hadoop HDFS and Kafka☆111Updated last year
- tokyo-olympic-azure-data-engineering-project☆207Updated 9 months ago
- This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science t…☆119Updated 3 months ago
- Transform data from on-premises SQL Server to Azure Delta Lake Storage for Analytics and Visualization☆10Updated last year
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.☆143Updated 9 months ago
- This is a template you can use for your next data engineering portfolio project.☆176Updated 3 years ago
- ☆22Updated 3 years ago
- ☆139Updated 2 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆129Updated 11 months ago
- ☆278Updated 9 months ago
- Azure Data Factory☆62Updated last month
- ☆18Updated last year
- For the Coursera specialization https://www.coursera.org/specializations/gcp-data-machine-learning☆94Updated 7 years ago
- Data Engineering Essentials☆19Updated 4 months ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆76Updated 11 months ago
- Sample project to demonstrate data engineering best practices☆190Updated last year
- Sample repo for startdataengineering DE 101 free course☆62Updated 10 months ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆22Updated 2 years ago
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆346Updated last year
- This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark cluste…☆11Updated last year