ShawhinT / data-pipeline-example
Example data pipeline automation with GitHub Actions
☆14Updated last week
Alternatives and similar repositories for data-pipeline-example:
Users that are interested in data-pipeline-example are comparing it to the libraries listed below
- Demo on how to use Prefect with Docker☆25Updated 2 years ago
- an end-to-end data pipeline extracting music listening habits and producing an insightful dashboard☆14Updated 9 months ago
- build dw with dbt☆33Updated 2 months ago
- Analyzing Video Assistant Referee (VAR) decisions in the English Premier League (2019 - 2021)☆12Updated 3 years ago
- Code for my "Efficient Data Processing in SQL" book.☆54Updated 5 months ago
- Use Multiple Linear Regression, Python, Pandas, and Matplotlib to analyze the lifetime value and the key factors of the ‘Telco Customer C…☆10Updated 4 years ago
- A pipeline to detect data drift and retrain the model when there is drift☆23Updated last year
- Data pipeline project using Data Factory, Databricks and Cosmosdb Graph, deployed using Azure DevOps, secured using firewalls and Azure A…☆11Updated 2 years ago
- A simple app to classify dogs using fastai and streamlit.☆17Updated 3 years ago
- Create a local dashboard to visualize and filter your GitHub feed☆29Updated 2 years ago
- A demo of the Mito Streamlit Spreadsheet☆17Updated last year
- Check the basic quality of any dataset☆11Updated 3 years ago
- Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆28Updated last year
- Repository for Data Engineering Zoomcamp 2024☆13Updated 9 months ago
- Demonstrating the efficiency of pmdarima’s auto_arima() function compared to implementing a traditional ARIMA model.☆9Updated 3 years ago
- A repository containing data and files for my stories on Medium.com.☆54Updated last month
- ☆38Updated 6 months ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆61Updated 7 months ago
- Daily US Stock Market summary with focus on price action & statistics☆39Updated last month
- A Postgres data warehouse for processing synthetic data using IAC principles☆15Updated last year
- ☆43Updated 2 years ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29Updated last year
- Data Engineer Roadmaps as Projects Funnel☆11Updated 2 years ago
- Data pipeline that scrapes Rust cheater Steam profiles☆51Updated 2 years ago
- ☆9Updated last year
- Cost Efficient Data Pipelines with DuckDB☆48Updated 5 months ago
- Resources and notebooks to accompany the Duplicate Detection using GenAI paper☆11Updated 6 months ago
- ☆15Updated 11 months ago
- Faster pandas☆35Updated last year
- Intro to Polars Tutorial☆21Updated last year