delta-io/delta-sharing

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/delta-io/delta-sharing)

delta-io / delta-sharing

An open protocol for secure data sharing

☆952

Alternatives and similar repositories for delta-sharing

Users that are interested in delta-sharing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

delta-io / delta
View on GitHub
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…
☆8,925Updated this week
delta-io / delta-rs
View on GitHub
A native Rust library for Delta Lake, with bindings into Python
☆3,274Updated this week
unitycatalog / unitycatalog
View on GitHub
Open, Multi-modal Catalog for Data & AI
☆3,469Updated this week
delta-io / kafka-delta-ingest
View on GitHub
A highly efficient daemon for streaming data from Kafka into Delta Lake
☆439Jun 22, 2026Updated last month
delta-incubator / delta-sharing-rs
View on GitHub
A Minimalistic Rust Implementation of Delta Sharing Server.
☆97Mar 17, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
rajagurunath / lakehouse-sharing
View on GitHub
A Table format agnostic data sharing framework
☆42Feb 4, 2024Updated 2 years ago
projectnessie / nessie
View on GitHub
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
☆1,483Updated this week
awslabs / deequ
View on GitHub
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
☆3,638Updated this week
delta-io / delta-kernel-rs
View on GitHub
A native Delta implementation for integration with any query engine
☆351Updated this week
databrickslabs / arcuate
View on GitHub
Delta Sharing + MLflow for ML model & experiment exchange (arcuate delta - a fan shaped river delta)
☆22Jan 29, 2026Updated 5 months ago
OpenLineage / OpenLineage
View on GitHub
An Open Standard for lineage metadata collection
☆2,562Updated this week
databricks / koalas
View on GitHub
Koalas: pandas API on Apache Spark
☆3,372Mar 20, 2024Updated 2 years ago
datamechanics / delight
View on GitHub
A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.
☆345May 31, 2024Updated 2 years ago
unitycatalog / unitycatalog-rs
View on GitHub
Open, Multi-modal Catalog for Data & AI, written in Rust
☆86Sep 30, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
apache / iceberg
View on GitHub
Apache Iceberg
☆9,081Updated this week
databricks / databricks-cli
View on GitHub
(Legacy) Command Line Interface for Databricks
☆397Oct 5, 2023Updated 2 years ago
delta-io / delta-examples
View on GitHub
Delta Lake examples
☆241Oct 8, 2024Updated last year
sodadata / soda-core
View on GitHub
Data Contracts engine for the modern data stack. https://www.soda.io
☆2,397Updated this week
trinodb / trino
View on GitHub
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
☆13,076Updated this week
pyspark-ai / pyspark-ai
View on GitHub
English SDK for Apache Spark
☆876Jun 12, 2024Updated 2 years ago
microsoft / hyperspace
View on GitHub
An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
☆430Jan 14, 2022Updated 4 years ago
amundsen-io / amundsen
View on GitHub
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…
☆4,780Jul 1, 2026Updated 3 weeks ago
fugue-project / fugue
View on GitHub
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…
☆2,170May 19, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
G-Research / spark-extension
View on GitHub
A library that provides useful extensions to Apache Spark and PySpark.
☆239Jul 1, 2026Updated 3 weeks ago
fivetran / great_expectations
View on GitHub
Always know what to expect from your data.
☆11,675Updated this week
apache / hudi
View on GitHub
Upserts, Deletes And Incremental Processing on Big Data.
☆6,197Updated this week
delta-incubator / deltaray
View on GitHub
Delta reader for the Ray open-source toolkit for building ML applications
☆46Jan 27, 2024Updated 2 years ago
awslabs / python-deequ
View on GitHub
Python API for Deequ
☆824Updated this week
treeverse / lakeFS
View on GitHub
lakeFS - Data version control for your data lake | Git for data
☆5,475Updated this week
databrickslabs / ucx
View on GitHub
Automated migrations to Unity Catalog
☆308Jun 12, 2026Updated last month
datahub-project / datahub
View on GitHub
The Context Platform for your Data and AI Stack
☆12,356Updated this week
databrickslabs / dbx
View on GitHub
🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.
☆463Mar 27, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
databricks / databricks-sdk-py
View on GitHub
Databricks SDK for Python (Beta)
☆560Updated this week
databrickslabs / overwatch
View on GitHub
THIS PROJECT IS DEPRECATED. Capture deep metrics on one or all assets within a Databricks workspace
☆230Jan 8, 2026Updated 6 months ago
apache / datafusion-comet
View on GitHub
Apache DataFusion Comet Spark Accelerator
☆1,234Updated this week
apache / polaris
View on GitHub
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
☆2,022Updated this week
databricks / terraform-provider-databricks
View on GitHub
Databricks Terraform Provider
☆597Updated this week
apache / incubator-xtable
View on GitHub
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processin…
☆1,197Updated this week
NVIDIA / cudf-spark
View on GitHub
NVIDIA cuDF for Apache Spark plugin - accelerate Apache Spark with GPUs
☆991Updated this week