jackmleitch / StravaDataPipline
EtLT of my own Strava data using the Strava API, MySQL, Python, S3, Redshift, and Airflow
☆31Updated 2 years ago
Alternatives and similar repositories for StravaDataPipline:
Users that are interested in StravaDataPipline are comparing it to the libraries listed below
- Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆28Updated 2 years ago
- Predicting Strava Kudos on my own activities using the given activity's attributes.☆12Updated 2 years ago
- Pipeline that extracts data from the Spotify API to build a more detailed version of Spotify Wrapped☆34Updated last year
- Processing TfL data for bike usage with Google Cloud Platform.☆45Updated 2 years ago
- A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.☆232Updated 10 months ago
- ☆33Updated last year
- End to end data engineering project☆54Updated 2 years ago
- Data pipeline that scrapes Rust cheater Steam profiles☆52Updated 3 years ago
- Tracking and measuring neighborhood and district-level eviction rates in the city of San Francisco.☆139Updated 4 years ago
- Some example projects for Data Engineers to build, end-to-end.☆28Updated last year
- Near real time ETL to populate a dashboard.☆73Updated 9 months ago
- ☆17Updated 2 years ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆260Updated 9 months ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 8 months ago
- Template for Data Engineering and Data Pipeline projects☆109Updated 2 years ago
- ☆144Updated last year
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆31Updated 2 months ago
- End-to-end data engineer project☆18Updated last year
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆84Updated 5 years ago
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆66Updated last year
- A tutorial for the Great Expectations library.☆70Updated 4 years ago
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆35Updated 11 months ago
- Repo for saving cheat sheets☆48Updated 10 months ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Updated 2 years ago
- Code for "Advanced data transformations in SQL" free live workshop☆75Updated 5 months ago
- A real-time reddit data streaming pipeline for sentiment analysis of various subreddits☆123Updated last year
- Code for dbt tutorial☆155Updated 10 months ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆73Updated 10 months ago
- ☆106Updated 2 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆142Updated 4 years ago