apache / polaris
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
☆1,158Updated this week
Related projects ⓘ
Alternatives and complementary repositories for polaris
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,040Updated this week
- Apache PyIceberg☆473Updated this week
- An open protocol for secure data sharing☆770Updated last week
- An Open Standard for lineage metadata collection☆1,772Updated this week
- Apache DataFusion Comet Spark Accelerator☆821Updated this week
- dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks☆405Updated last week
- Efficient data transformation and modeling framework that is backwards compatible with dbt.☆1,824Updated this week
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)☆923Updated this week
- Open, Multi-modal Catalog for Data & AI☆2,432Updated this week
- Lakekeeper: A Rust native Iceberg REST Catalog☆234Updated this week
- Collect, aggregate, and visualize a data ecosystem's metadata☆1,781Updated last week
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆217Updated this week
- Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code☆760Updated this week
- Open Control Plane for Tables in Data Lakehouse☆312Updated this week
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆369Updated last week
- Turning PySpark Into a Universal DataFrame API☆323Updated this week
- Home of the Open Data Contract Standard (ODCS).☆392Updated last week
- ☆252Updated 3 weeks ago
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,205Updated this week
- Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data acces…☆422Updated this week
- Python client for Trino☆335Updated this week
- 📙 Awesome Data Catalogs and Observability Platforms.☆727Updated 3 months ago
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.☆350Updated this week
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆795Updated this week
- New Generation Opensource Data Stack Demo☆410Updated last year
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆1,913Updated last week
- dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipe…☆393Updated this week