☆123Jul 24, 2025Updated 8 months ago
Alternatives and similar repositories for microbatch-hourly-deduped-tutorial
Users that are interested in microbatch-hourly-deduped-tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository goes over how to handle massive variety in data engineering☆317Jan 16, 2023Updated 3 years ago
- This repository helps teach people how to correctly define and create cumulative tables!☆757Oct 29, 2024Updated last year
- Hey this is the repo that has all the queries and data for my video game training series!☆160Jun 5, 2022Updated 3 years ago
- Full codebase for rootski.io (without the data)☆29Sep 17, 2022Updated 3 years ago
- csv and flat-file sniffer built in Rust.☆45Jan 26, 2024Updated 2 years ago
- Data Vault Modeling☆15May 25, 2025Updated 9 months ago
- Example FastAPI app deployed to AWS with CDK.☆16Feb 23, 2023Updated 3 years ago
- ☆13Jul 1, 2025Updated 8 months ago
- ☆14Dec 28, 2023Updated 2 years ago
- This is a public repository to go over all the LLM-driven data engineering concepts.☆1,133Oct 26, 2024Updated last year
- Code for my "Efficient Data Processing in SQL" book.☆61Aug 6, 2024Updated last year
- This Power BI project provides insights into customer orders and product tracking using interactive dashboards. It visualizes order statu…☆10Aug 15, 2025Updated 7 months ago
- Streaming analytics project with eventsim and Kafka☆13Dec 23, 2022Updated 3 years ago
- Capstone Project for DataExpert.io V4 Cohort☆13Jul 8, 2024Updated last year
- fst: flow state tool | smooth where you want it, friction where you need it when data engineering☆33Jun 13, 2023Updated 2 years ago
- Sample project to demonstrate data engineering best practices☆211Feb 24, 2024Updated 2 years ago
- A bentoML-powered API to transcribe audio and make sense of it☆39Dec 21, 2022Updated 3 years ago
- ☆23May 16, 2023Updated 2 years ago
- This repo has all the resources you need to become an amazing analytics engineer!☆319Feb 17, 2026Updated last month
- Time Series Analysis and Its Applications, Ed 5☆20Mar 14, 2026Updated last week
- ☆11Dec 14, 2019Updated 6 years ago
- ☆23Oct 6, 2025Updated 5 months ago
- Examples surrounding Databricks.☆60Jul 4, 2024Updated last year
- Spark development environment for kubernetes, spark-submit and jupyter notebook☆19Nov 30, 2021Updated 4 years ago
- Python Data Mining Cookbook by Packt☆11Jan 14, 2021Updated 5 years ago
- CS231N project☆12Dec 17, 2018Updated 7 years ago
- ☆110Jan 23, 2025Updated last year
- A data engineering personal project for applying some of my skills☆19Jul 11, 2021Updated 4 years ago
- This repository hosts materials for the Docker for Data Engineers workshop, offering hands-on exercises and resources tailored for data e…☆17May 23, 2024Updated last year
- ☆16Jan 26, 2025Updated last year
- This extension makes vscode seamlessly work with dbt and bigquery☆15Sep 27, 2022Updated 3 years ago
- A demo instance of mage for pulling sample data from a public Google pub/sub topic and transforming with dbt.☆12Jan 5, 2024Updated 2 years ago
- ☆18Dec 2, 2024Updated last year
- Prefect integrations for working with OpenAI.☆34Updated this week
- ☆10Oct 24, 2023Updated 2 years ago
- This is a repo with links to everything you'd ever want to learn about data engineering☆40,628Updated this week
- Simple type converters: make ints, floats, bools and dates from your strings!☆11Jul 23, 2016Updated 9 years ago
- Twitter auto account report bot using selenium with python☆13Apr 19, 2024Updated last year
- Using Selenium and Beautiful Soup to scrape marathon images☆10Feb 21, 2019Updated 7 years ago