airscholar / FootballDataEngineering
An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Factory, Azure Synapse and Tableau.
☆18Updated last year
Alternatives and similar repositories for FootballDataEngineering:
Users that are interested in FootballDataEngineering are comparing it to the libraries listed below
- ☆145Updated 2 years ago
- Data Engineering YouTube Analysis Project by Darshil Parmar☆172Updated last year
- ☆130Updated 2 years ago
- Sample repo for startdataengineering DE 101 free course☆45Updated 7 months ago
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆317Updated last year
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆223Updated last year
- Sample project to demonstrate data engineering best practices☆175Updated 11 months ago
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆109Updated last year
- ☆28Updated last year
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆254Updated 6 months ago
- This repo contains all the code used in the Python for Data Engineering Course☆244Updated 9 months ago
- The resources of the preparation course for Databricks Data Engineer Associate certification exam☆326Updated last month
- This is a template you can use for your next data engineering portfolio project.☆174Updated 3 years ago
- ☆16Updated 6 months ago
- YouTube tutorial project☆99Updated last year
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.☆110Updated 5 months ago
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆25Updated last year
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆215Updated 2 years ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆63Updated 7 months ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆76Updated 5 months ago
- Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consu…☆67Updated last year
- ☆190Updated last year
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆21Updated last year
- ☆263Updated 5 months ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆24Updated last year
- Code for "Efficient Data Processing in Spark" Course☆272Updated 4 months ago
- tokyo-olympic-azure-data-engineering-project☆179Updated 6 months ago
- This repository will contain all of the resources for the Mage component of the Data Engineering Zoomcamp: https://github.com/DataTalksCl…☆98Updated 5 months ago
- sql-for-data-engineering-course☆19Updated last year
- apache-spark-with-databricks-for-data-engineering☆66Updated 6 months ago