linkedin/openhouse

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/linkedin/openhouse)

linkedin / openhouse

Open Control Plane for Tables in Data Lakehouse

☆392

Alternatives and similar repositories for openhouse

Users that are interested in openhouse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

apache / polaris
View on GitHub
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
☆2,018Updated this week
lakekeeper / lakekeeper
View on GitHub
Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.
☆1,392Updated this week
linkedin / coral
View on GitHub
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
☆907Updated this week
apache / incubator-xtable
View on GitHub
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processin…
☆1,194Updated this week
nimtable / nimtable
View on GitHub
The observability platform for Iceberg lakehouses.
☆468Jan 12, 2026Updated 6 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
projectnessie / nessie
View on GitHub
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
☆1,481Updated this week
apache / datafusion-comet
View on GitHub
Apache DataFusion Comet Spark Accelerator
☆1,230Updated this week
databricks / iceberg-kafka-connect
View on GitHub
☆285Jul 3, 2025Updated last year
GlareDB / glaredb
View on GitHub
GlareDB: A light and fast SQL database for analytics
☆1,017Nov 14, 2025Updated 8 months ago
unitycatalog / unitycatalog
View on GitHub
Open, Multi-modal Catalog for Data & AI
☆3,462Updated this week
memiiso / debezium-server-iceberg
View on GitHub
Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake
☆324Updated this week
ray-project / deltacat
View on GitHub
A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…
☆282Apr 17, 2026Updated 3 months ago
eakmanrq / sqlframe
View on GitHub
Turning PySpark Into a Universal DataFrame API
☆526Updated this week
nimtable / iceberg-compaction
View on GitHub
Compaction runtime for Apache Iceberg.
☆131Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
apache / gravitino
View on GitHub
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
☆3,097Updated this week
Eventual-Inc / Daft
View on GitHub
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale
☆5,644Updated this week
aws-samples / apache-xtable-on-aws-samples
View on GitHub
☆11Jun 8, 2026Updated last month
apache / iceberg
View on GitHub
Apache Iceberg
☆9,065Updated this week
awslabs / s3-tables-catalog
View on GitHub
The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…
☆154Jan 26, 2026Updated 5 months ago
linkedin / venice
View on GitHub
Venice, Derived Data Platform for Planet-Scale Workloads.
☆609Updated this week
apache / gluten
View on GitHub
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
☆1,576Updated this week
xorq-labs / xorq
View on GitHub
Executable memory system for tabular data that works in your harness.
☆537Updated this week
dlt-hub / dlt-dagster-demo
View on GitHub
dlt-dagster-demo
☆14Nov 6, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
linkedin / transport
View on GitHub
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…
☆306Jun 29, 2026Updated 3 weeks ago
apache / iceberg-rust
View on GitHub
Apache Iceberg
☆1,348Updated this week
databricks / docker-spark-iceberg
View on GitHub
☆384Feb 15, 2026Updated 5 months ago
linkedin / Hoptimator
View on GitHub
Multi-hop declarative data pipelines
☆126Updated this week
onehouseinc / LakeView
View on GitHub
Monitoring and insights on your data lakehouse tables
☆32Jul 13, 2026Updated last week
apache / fluss
View on GitHub
Apache Fluss is a streaming storage built for real-time analytics.
☆1,991Updated this week
voltrondata / superset-sqlalchemy-adbc-flight-sql-poc
View on GitHub
A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.
☆25Sep 8, 2023Updated 2 years ago
apache / amoro
View on GitHub
Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.
☆1,149Updated this week
SQLMesh / sqlmesh
View on GitHub
Scalable and efficient data transformation framework - backwards compatible with dbt.
☆3,208Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
facebookincubator / velox
View on GitHub
A composable and fully extensible C++ execution engine library for data management systems.
☆4,173Updated this week
duckdb / unity_catalog
View on GitHub
Proof-of-concept extension combining the delta extension with Unity Catalog
☆104Jul 12, 2026Updated last week
facebookincubator / nimble
View on GitHub
New and extensible file format for storage of large columnar datasets.
☆728Updated this week
apache / datafusion
View on GitHub
Apache DataFusion SQL Query Engine
☆9,000Updated this week
apache / iceberg-python
View on GitHub
PyIceberg
☆1,097Updated this week
databricks / iceberg-rest-image
View on GitHub
Simple project to expose a catalog over REST using a Java catalog backend
☆155Jan 19, 2025Updated last year
substrait-io / substrait
View on GitHub
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
☆1,535Updated this week