jackmleitch / StravaDataPiplineLinks
EtLT of my own Strava data using the Strava API, MySQL, Python, S3, Redshift, and Airflow
☆32Updated 2 years ago
Alternatives and similar repositories for StravaDataPipline
Users that are interested in StravaDataPipline are comparing it to the libraries listed below
Sorting:
- Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆30Updated 2 years ago
- A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.☆237Updated last year
- Some example projects for Data Engineers to build, end-to-end.☆30Updated last year
- Pipeline that extracts data from the Spotify API to build a more detailed version of Spotify Wrapped☆35Updated last year
- Predicting Strava Kudos on my own activities using the given activity's attributes.☆12Updated 2 years ago
- Processing TfL data for bike usage with Google Cloud Platform.☆45Updated 2 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 10 months ago
- Example repo to create end to end tests for data pipeline.☆24Updated 11 months ago
- I will attempt to create my own spotify wrapped by collecting data from the spotify API, perform transformations and create informative d…☆74Updated 2 years ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆262Updated 10 months ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆140Updated 10 months ago
- Near real time ETL to populate a dashboard.☆72Updated 11 months ago
- A tutorial for the Great Expectations library.☆71Updated 4 years ago
- ☆34Updated 2 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆146Updated 4 years ago
- Tracking and measuring neighborhood and district-level eviction rates in the city of San Francisco.☆139Updated 4 years ago
- Project for "Data pipeline design patterns" blog.☆45Updated 10 months ago
- Code for dbt tutorial☆157Updated last year
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆90Updated 5 years ago
- Simple stream processing pipeline☆103Updated 11 months ago
- Sample project to demonstrate data engineering best practices☆191Updated last year
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆230Updated 2 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 2 years ago
- ☆109Updated 3 years ago
- Data pipeline that scrapes Rust cheater Steam profiles☆51Updated 3 years ago
- streaming eight subreddits from reddit api using kafka producer & spark structured streaming.☆19Updated 2 months ago
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆33Updated 3 weeks ago
- End to end data engineering project☆56Updated 2 years ago
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆69Updated last year
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆29Updated last year