projectnessie / nessie
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
☆1,159Updated this week
Alternatives and similar repositories for nessie:
Users that are interested in nessie are comparing it to the libraries listed below
- Apache PyIceberg☆652Updated this week
- An Open Standard for lineage metadata collection☆1,879Updated this week
- Apache Polaris, the interoperable, open source catalog for Apache Iceberg☆1,384Updated this week
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆498Updated this week
- An open protocol for secure data sharing☆817Updated this week
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆828Updated last month
- Apache DataFusion Comet Spark Accelerator☆921Updated this week
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆393Updated 2 weeks ago
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,274Updated this week
- Dremio - the missing link in modern data☆1,418Updated 4 months ago
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,297Updated this week
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆230Updated 3 months ago
- Collect, aggregate, and visualize a data ecosystem's metadata☆1,879Updated last week
- Open Control Plane for Tables in Data Lakehouse☆331Updated last week
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆237Updated last week
- ☆188Updated last week
- Python client for Trino☆355Updated last week
- A load balancer / proxy / gateway for prestodb☆357Updated 7 months ago
- ☆260Updated 5 months ago
- dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks☆422Updated last month
- Open, Multi-modal Catalog for Data & AI☆2,735Updated this week
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆736Updated last month
- Data Lineage Tracking And Visualization Solution☆615Updated this week
- Apache DataFusion Ballista Distributed Query Engine☆1,689Updated this week
- Egeria core☆833Updated this week
- Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.☆1,429Updated this week
- ☆238Updated this week
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)☆1,022Updated last week
- Generate and Visualize Data Lineage from query history☆322Updated last year
- Efficient data transformation and modeling framework that is backwards compatible with dbt.☆2,180Updated this week