☆124Jul 24, 2025Updated 10 months ago
Alternatives and similar repositories for microbatch-hourly-deduped-tutorial
Users that are interested in microbatch-hourly-deduped-tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository goes over how to handle massive variety in data engineering☆324Jan 16, 2023Updated 3 years ago
- This repository helps teach people how to correctly define and create cumulative tables!☆765Oct 29, 2024Updated last year
- Hey this is the repo that has all the queries and data for my video game training series!☆160Jun 5, 2022Updated 4 years ago
- csv and flat-file sniffer built in Rust.☆45Jan 26, 2024Updated 2 years ago
- Data Vault Modeling☆15May 25, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Example FastAPI app deployed to AWS with CDK.☆16Feb 23, 2023Updated 3 years ago
- ☆13Dec 28, 2023Updated 2 years ago
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆39Oct 14, 2025Updated 8 months ago
- This Power BI project provides insights into customer orders and product tracking using interactive dashboards. It visualizes order statu…☆10Aug 15, 2025Updated 10 months ago
- Code for my "Efficient Data Processing in SQL" book.☆62Aug 6, 2024Updated last year
- Streaming analytics project with eventsim and Kafka☆13Dec 23, 2022Updated 3 years ago
- Capstone Project for DataExpert.io V4 Cohort☆13Jul 8, 2024Updated last year
- A python web scrap and data analytics project used to identify key metrics and BI insights about Brazilian Real Estate Investment Fund (a…☆14Aug 26, 2020Updated 5 years ago
- ☆17May 23, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- fst: flow state tool | smooth where you want it, friction where you need it when data engineering☆33Jun 13, 2023Updated 3 years ago
- This is the starter code for both the course and the project for Data Streaming with Spark☆17Jul 21, 2022Updated 3 years ago
- Sample project to demonstrate data engineering best practices☆220Feb 24, 2024Updated 2 years ago
- ☆18Jan 3, 2024Updated 2 years ago
- ☆116Jan 15, 2025Updated last year
- ☆23May 16, 2023Updated 3 years ago
- ☆160Feb 25, 2026Updated 3 months ago
- This repo has all the resources you need to become an amazing analytics engineer!☆351Feb 17, 2026Updated 3 months ago
- ☆11Dec 14, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Spark development environment for kubernetes, spark-submit and jupyter notebook☆18Nov 30, 2021Updated 4 years ago
- 65 Articles on SQL: A Comprehensive Guide to Mastering Advanced SQL☆11Jun 7, 2023Updated 3 years ago
- Python Data Mining Cookbook by Packt☆11Jan 14, 2021Updated 5 years ago
- FIWARE 305: Real-time Processing of Context Data using Apache Flink☆11May 15, 2026Updated last month
- ☆113Jan 23, 2025Updated last year
- This repository hosts materials for the Docker for Data Engineers workshop, offering hands-on exercises and resources tailored for data e…☆17May 23, 2024Updated 2 years ago
- This extension makes vscode seamlessly work with dbt and bigquery☆15Sep 27, 2022Updated 3 years ago
- ☆15Jan 26, 2025Updated last year
- Multipage Application built using Dash☆11Jan 24, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A reference implementation of an end to end, open-source MLOps platform.☆15Nov 20, 2022Updated 3 years ago
- ☆10Oct 24, 2023Updated 2 years ago
- This is a repo with links to everything you'd ever want to learn about data engineering☆41,674Apr 2, 2026Updated 2 months ago
- A build tool to turn markdown into an html presentation and then publish to gh-pages☆29May 19, 2026Updated 3 weeks ago
- Using Selenium and Beautiful Soup to scrape marathon images☆10Feb 21, 2019Updated 7 years ago
- Upload of all my presentations which I've been doing in the past☆10May 21, 2026Updated 3 weeks ago
- Airflow & DBT Cloud Integrated Project Presented at Lagos DBT Community Meetup & DataFestAfrica 23☆13Oct 11, 2023Updated 2 years ago