A Table format agnostic data sharing framework
☆42Feb 4, 2024Updated 2 years ago
Alternatives and similar repositories for lakehouse-sharing
Users that are interested in lakehouse-sharing are comparing it to the libraries listed below
Sorting:
- ☆30Dec 4, 2024Updated last year
- A Minimalistic Rust Implementation of Delta Sharing Server.☆98Mar 17, 2025Updated 11 months ago
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Jun 12, 2024Updated last year
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆46Dec 14, 2025Updated 2 months ago
- ☆13Oct 4, 2023Updated 2 years ago
- A leightweight UI for Lakekeeper☆16Mar 2, 2026Updated last week
- Delta Lake examples☆240Oct 8, 2024Updated last year
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated this week
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆15Feb 15, 2025Updated last year
- MLflow-tracking server example with Minio and H2O☆18Oct 25, 2019Updated 6 years ago
- Fybrik platform - Arrow/Flight module☆15Aug 10, 2024Updated last year
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Mar 9, 2021Updated 5 years ago
- ☆21Aug 26, 2025Updated 6 months ago
- A write-audit-publish implementation on a data lake without the JVM☆45Aug 12, 2024Updated last year
- Open, Multi-modal Catalog for Data & AI, written in Rust☆86Sep 30, 2024Updated last year
- Yet Another (Spark) ETL Framework☆21Oct 21, 2023Updated 2 years ago
- Apache Hive Metastore as a Standalone server in Docker☆80Aug 22, 2024Updated last year
- Glue JupyterLab Extension☆20Mar 2, 2026Updated last week
- ☆23May 2, 2024Updated last year
- Apache DataFusion Ray☆228Oct 5, 2025Updated 5 months ago
- An open protocol for secure data sharing☆920Updated this week
- A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino☆95Feb 28, 2026Updated last week
- ☆17Nov 25, 2024Updated last year
- ☆30Feb 25, 2025Updated last year
- A tool to generate PySpark schema from JSON.☆28Jan 21, 2024Updated 2 years ago
- Lance Namespace is an open specification for describing access and operations against a collection of tables in a multimodal lakehouse☆51Feb 23, 2026Updated 2 weeks ago
- ☆37Apr 9, 2025Updated 11 months ago
- Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!☆235Jan 24, 2025Updated last year
- 🤖 A GitHub action that leverages fabric patterns through an agent-based approach☆34Jan 4, 2025Updated last year
- ☆14Feb 15, 2025Updated last year
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Dec 31, 2024Updated last year
- Generating Federated GraphQL API's from Datasources with Apache Calcite☆37Feb 21, 2022Updated 4 years ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆83Apr 12, 2025Updated 10 months ago
- DataFusion TableProviders for reading data from other systems☆172Updated this week
- The full set of microservices for Fleetman without needing a Euerka Registry.☆11Aug 1, 2024Updated last year
- Apache Polaris Tools, additional tooling for Apache Polaris☆25Updated this week
- Utility functions to support analytics over FHIR in BigQuery or Apache Spark☆15Jan 8, 2024Updated 2 years ago
- a curated list of awesome lakehouse frameworks, applications, etc☆42Feb 9, 2026Updated last month
- OPI5 open micro desk design.☆13Mar 6, 2023Updated 3 years ago