projectnessie / nessie
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
☆984Updated this week
Related projects: ⓘ
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆780Updated 2 weeks ago
- An Open Standard for lineage metadata collection☆1,708Updated this week
- Apache PyIceberg☆385Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,167Updated this week
- Apache DataFusion Comet Spark Accelerator☆745Updated last week
- The interoperable, open source catalog for Apache Iceberg☆1,012Updated this week
- An open protocol for secure data sharing☆747Updated 3 weeks ago
- ☆375Updated this week
- Dremio - the missing link in modern data☆1,356Updated 2 weeks ago
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆354Updated last week
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,143Updated this week
- A load balancer / proxy / gateway for prestodb☆356Updated last month
- Collect, aggregate, and visualize a data ecosystem's metadata☆1,732Updated last week
- Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.☆1,126Updated this week
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆341Updated 3 months ago
- ☆197Updated last month
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆207Updated last week
- dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks☆390Updated this week
- ☆232Updated this week
- Open Control Plane for Tables in Data Lakehouse☆289Updated this week
- Data Lineage Tracking And Visualization Solution☆596Updated last week
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆692Updated last month
- Python client for Trino☆322Updated 2 weeks ago
- ☆248Updated last week
- Egeria core☆796Updated this week
- A simplified, lightweight ETL Framework based on Apache Spark☆581Updated 7 months ago
- Replicates any database (CDC events) to Apache Iceberg (To Cloud Storage)☆179Updated this week
- ☆144Updated this week
- 📙 Awesome Data Catalogs and Observability Platforms.☆677Updated last month
- Apache DataFusion Ballista Distributed Query Engine☆1,468Updated this week