airscholar / FootballDataEngineeringLinks
An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Factory, Azure Synapse and Tableau.
☆31Updated 2 years ago
Alternatives and similar repositories for FootballDataEngineering
Users that are interested in FootballDataEngineering are comparing it to the libraries listed below
Sorting:
- This repo contains all the code used in the Python for Data Engineering Course☆325Updated last year
- Data Engineering YouTube Analysis Project by Darshil Parmar☆213Updated 2 years ago
- datacamp Data Engineer with Python course. 73 hours/ 19 Courses /2 Skill Assessments☆133Updated 3 years ago
- Data Engineering Project with Hadoop HDFS and Kafka☆119Updated 2 years ago
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆181Updated 2 years ago
- This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science t…☆137Updated 11 months ago
- apache-spark-with-databricks-for-data-engineering☆95Updated last year
- ☆311Updated last year
- This is a template you can use for your next data engineering portfolio project.☆183Updated 4 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆163Updated 3 years ago
- ☆22Updated 4 years ago
- tokyo-olympic-azure-data-engineering-project☆217Updated last year
- The resources of the preparation course for Databricks Data Engineer Associate certification exam☆553Updated this week
- YouTube tutorial project☆105Updated 2 years ago
- For the Coursera specialization https://www.coursera.org/specializations/gcp-data-machine-learning☆97Updated 8 years ago
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.☆211Updated last year
- ☆381Updated 11 months ago
- I'm partaking in a Data Engineering Bootcamp / Zoomcamp. I'll store files and progress here.☆109Updated 3 years ago
- Azure Data Factory☆71Updated 5 months ago
- ☆162Updated 3 years ago
- Transform data from on-premises SQL Server to Azure Delta Lake Storage for Analytics and Visualization☆18Updated 2 years ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆25Updated 2 years ago
- Personal Data Engineering Projects☆973Updated 2 years ago
- sql-for-data-engineering-course☆18Updated 2 years ago
- Sample repo for startdataengineering DE 101 free course☆72Updated last year
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆370Updated 2 years ago
- Learn PySpark from Basics to Advanced. Checkout the YouTube Series : [PySpark - Zero to Hero]☆115Updated 3 months ago
- ☆146Updated 2 years ago
- A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!☆806Updated 3 years ago
- Data Engineering with Python, published by Packt☆771Updated 2 years ago