a curated list of awesome lakehouse frameworks, applications, etc
☆46Mar 9, 2026Updated 3 months ago
Alternatives and similar repositories for awesome-lakehouse
Users that are interested in awesome-lakehouse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Monitoring and insights on your data lakehouse tables☆32Updated this week
- ☆11Jun 8, 2026Updated last week
- ☆30Dec 4, 2024Updated last year
- ☆13Jun 10, 2024Updated 2 years ago
- The Data Landing Zone is a CDK Construct designed to create a landing zone tailored for supporting and enabling AI, data-driven, data mes…☆23Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Apache Hive Metastore in Standalone Mode With Docker☆14Jul 22, 2024Updated last year
- Olympia is a storage-only open catalog format for big data analytics, ML & AI.☆16May 5, 2025Updated last year
- ☆13Oct 12, 2024Updated last year
- A leightweight UI for Lakekeeper☆17Updated this week
- Open Control Plane for Tables in Data Lakehouse☆391Updated this week
- Iceberg Playground in a Box☆69Apr 8, 2026Updated 2 months ago
- Start debugger listener on a running Node.js process☆12Oct 24, 2019Updated 6 years ago
- ☆20Jun 16, 2020Updated 6 years ago
- A Table format agnostic data sharing framework☆41Feb 4, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Altinity Datasets for ClickHouse☆19Feb 20, 2025Updated last year
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Dec 9, 2023Updated 2 years ago
- Fast, zero-copy HTML Parser written in Rust☆30Dec 6, 2025Updated 6 months ago
- The home of Floecat: A catalog of catalogs for open table formats☆84Updated this week
- A playground to experience Gravitino☆79May 15, 2026Updated last month
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆15Feb 15, 2025Updated last year
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆85Apr 12, 2025Updated last year
- Open, Multi-modal Catalog for Data & AI, written in Rust☆84Sep 30, 2024Updated last year
- Batteries included CLI, TUI, and server implementations for DataFusion.☆198Apr 14, 2026Updated 2 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The observability platform for Iceberg lakehouses.☆462Jan 12, 2026Updated 5 months ago
- A complete data engineering project demonstrating modern data stack practices with Apache Flink, Iceberg, Trino and Superset☆26Sep 29, 2025Updated 8 months ago
- Python Package for ducklake☆20Jun 5, 2025Updated last year
- ☆53Jun 9, 2026Updated last week
- Point-in-Time optimizations for Apache Spark☆30Jan 18, 2024Updated 2 years ago
- A DSL for scalacOptions☆17Updated this week
- Computer science fundamentals.☆21Aug 18, 2025Updated 10 months ago
- A cloud native data mesh implementation☆12Jan 15, 2021Updated 5 years ago
- The Go library for pulsar admin operations, providing a unified Go API for managing pulsar resources such as tenants, namespaces and top…☆14Aug 23, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- My resume (or yours) as a TUI application☆39Aug 29, 2025Updated 9 months ago
- Command line debugging console for Cats Effect☆19Apr 2, 2024Updated 2 years ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆39Feb 17, 2025Updated last year
- DuckDB Pyroscope Extension for Continuous Profiling☆21Feb 18, 2026Updated 4 months ago
- ☆23Sep 7, 2023Updated 2 years ago
- MCP server for Apache Iceberg☆34Nov 17, 2025Updated 7 months ago
- collection of read materials☆18May 18, 2020Updated 6 years ago