jackmleitch / StravaDataPiplineLinks
  EtLT of my own Strava data using the Strava API, MySQL, Python, S3, Redshift, and Airflow
☆32Updated 3 years ago
Alternatives and similar repositories for StravaDataPipline
Users that are interested in StravaDataPipline are comparing it to the libraries listed below
Sorting:
- Processing TfL data for bike usage with Google Cloud Platform.☆46Updated 3 years ago
 - Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆30Updated 2 years ago
 - A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.☆248Updated last year
 - Pipeline that extracts data from the Spotify API to build a more detailed version of Spotify Wrapped☆47Updated last week
 - Some example projects for Data Engineers to build, end-to-end.☆34Updated last year
 - Template for Data Engineering and Data Pipeline projects☆114Updated 2 years ago
 - Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆40Updated last year
 - Code for dbt tutorial☆162Updated last month
 - Predicting Strava Kudos on my own activities using the given activity's attributes.☆14Updated 3 years ago
 - Code for blog at https://www.startdataengineering.com/post/python-for-de/☆88Updated last year
 - An end-to-end data pipeline which extracts divvy bikeshare data from web loads it into data lake and datawarehouse transforms it using db…☆37Updated 2 years ago
 - Sample project to demonstrate data engineering best practices☆197Updated last year
 - End to end data engineering project☆57Updated 3 years ago
 - A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆78Updated 2 years ago
 - Code for my "Efficient Data Processing in SQL" book.☆59Updated last year
 - Data pipeline that scrapes Rust cheater Steam profiles☆54Updated 3 years ago
 - Data Pipeline from the Global Historical Climatology Network DataSet☆27Updated 2 years ago
 - A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆279Updated last year
 - This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆93Updated 6 years ago
 - Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆157Updated 5 years ago
 - A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation an…☆23Updated last year
 - ☆161Updated 3 years ago
 - Project for "Data pipeline design patterns" blog.☆46Updated last year
 - ☆35Updated 2 years ago
 - Repository for Data Engineering Interview Series☆33Updated last year
 - Code for "Advanced data transformations in SQL" free live workshop☆85Updated 5 months ago
 - Price Crawler - Tracking Price Inflation☆188Updated 5 years ago
 - A tutorial for the Great Expectations library.☆73Updated 4 years ago
 - ☆145Updated last year
 - end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆226Updated 3 weeks ago