Data Forge — a modern data stack playground to practice flows and best practices, not just tools. Spark, Trino, Kafka, Iceberg, ClickHouse, Airflow, MinIO, Superset — all wired together locally with Docker Compose.
☆171Oct 11, 2025Updated 5 months ago
Alternatives and similar repositories for data-forge
Users that are interested in data-forge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Distributed run of dbt models using Airflow☆168Feb 11, 2026Updated last month
- Data Engineering Digest☆29Jun 24, 2024Updated last year
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆58Jan 6, 2024Updated 2 years ago
- Применение Debezium для обработки потоковых данных: Основные концепции, примеры.☆18Apr 12, 2025Updated 11 months ago
- Getting Started with Data Enngineering☆1,319Apr 20, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Trino Group Provider LDAP is a Trino (formerly Presto SQL) plugin to map user names to groups using an LDAP server☆22Mar 27, 2024Updated 2 years ago
- This repository is no longer maintained.☆15Mar 10, 2022Updated 4 years ago
- Spark in Kubernetes☆39Jun 3, 2024Updated last year
- DWH powered by Clickhouse and dbt☆13Aug 4, 2024Updated last year
- ☆150Mar 3, 2026Updated 3 weeks ago
- This code is used to populate the "ODS jobs dump" Telegram bot, and it can be used for any other dumped Slack channel☆14Sep 12, 2022Updated 3 years ago
- python курс☆39Mar 23, 2026Updated last week
- Module for pipelines concept in PySpark☆16Mar 27, 2024Updated 2 years ago
- CraftML is a restful web service for easy pipeline creation without code.☆13Apr 18, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Art Project For Firefox displays masterpieces from Google Cultural Institute in your Firefox's new tabs☆14Apr 18, 2015Updated 10 years ago
- Surfalytics projces on Data Engineering and Analytics☆120Feb 27, 2026Updated last month
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆19Apr 25, 2024Updated last year
- ☆17May 22, 2023Updated 2 years ago
- This project is used to capture machine learning pipelines created on top of Spark as OK☆54Nov 1, 2022Updated 3 years ago
- Материалы для курса Введение в Data Engineering: дата пайплайны☆10Feb 18, 2024Updated 2 years ago
- A Procedure To Create A Yarn Cluster Based on Docker, Run Spark, And Do TPC-DS Performance Test.☆16Jan 3, 2024Updated 2 years ago
- Analytics Engineer Course☆20May 17, 2023Updated 2 years ago
- Explore the dbt Semantic Layer☆31May 26, 2025Updated 10 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- DTF.ru game info bot by RAWG.io☆17Jul 25, 2023Updated 2 years ago
- A table-type dbt materialization for Snowflake to enable Time Travel