airscholar / FootballDataEngineeringLinks
An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Factory, Azure Synapse and Tableau.
☆31Updated 2 years ago
Alternatives and similar repositories for FootballDataEngineering
Users that are interested in FootballDataEngineering are comparing it to the libraries listed below
Sorting:
- apache-spark-with-databricks-for-data-engineering☆98Updated last year
- This repo contains all the code used in the Python for Data Engineering Course☆334Updated last year
- Data Engineering YouTube Analysis Project by Darshil Parmar☆226Updated 2 years ago
- datacamp Data Engineer with Python course. 73 hours/ 19 Courses /2 Skill Assessments☆139Updated 3 years ago
- tokyo-olympic-azure-data-engineering-project☆221Updated last year
- ☆316Updated last year
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆164Updated 3 years ago
- ☆163Updated 3 years ago
- sql-for-data-engineering-course☆18Updated 2 years ago
- YouTube tutorial project☆108Updated 2 years ago
- This is a template you can use for your next data engineering portfolio project.☆186Updated 4 years ago
- ☆22Updated 4 years ago
- This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science t…☆137Updated last year
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆203Updated 2 years ago
- Data Engineering Project with Hadoop HDFS and Kafka☆122Updated 2 years ago
- ☆212Updated 2 years ago
- Data Engineering with Python, published by Packt☆780Updated 3 years ago
- ☆383Updated last year
- I'm partaking in a Data Engineering Bootcamp / Zoomcamp. I'll store files and progress here.☆111Updated 3 years ago
- Azure Data Factory☆73Updated 6 months ago
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.☆220Updated last year
- ☆148Updated 3 years ago
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆245Updated 3 years ago
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆373Updated 2 years ago
- For the Coursera specialization https://www.coursera.org/specializations/gcp-data-machine-learning☆97Updated 8 years ago
- Transform data from on-premises SQL Server to Azure Delta Lake Storage for Analytics and Visualization☆18Updated 2 years ago
- Sample repo for startdataengineering DE 101 free course☆74Updated last year
- Personal Data Engineering Projects☆987Updated 3 years ago
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆312Updated 11 months ago
- Learn PySpark from Basics to Advanced. Checkout the YouTube Series : [PySpark - Zero to Hero]☆120Updated 5 months ago