apache / iceberg-pythonLinks
PyIceberg
☆974Updated this week
Alternatives and similar repositories for iceberg-python
Users that are interested in iceberg-python are comparing it to the libraries listed below
Sorting:
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,395Updated this week
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆1,137Updated this week
- Apache Polaris, the interoperable, open source catalog for Apache Iceberg☆1,790Updated this week
- Turning PySpark Into a Universal DataFrame API☆471Updated this week
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)☆1,211Updated this week
- Apache DataFusion Comet Spark Accelerator☆1,096Updated this week
- ☆365Updated this week
- Python client for Trino☆408Updated 4 months ago
- Database connectivity API standard and libraries for Apache Arrow☆530Updated this week
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆253Updated 3 weeks ago
- The next-generation engine for dbt☆601Updated this week
- This repository has moved into https://github.com/dbt-labs/dbt-adapters☆444Updated 5 months ago
- This repository has moved into https://github.com/dbt-labs/dbt-adapters☆250Updated 11 months ago
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆424Updated 8 months ago
- An open protocol for secure data sharing☆914Updated this week
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆2,837Updated this week
- Run your dbt Core or dbt Fusion projects as Apache Airflow DAGs and Task Groups with a few lines of code☆1,118Updated this week
- Apache DataFusion Python Bindings☆549Updated this week
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.☆377Updated 7 months ago
- An Open Standard for lineage metadata collection☆2,246Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,450Updated this week
- PyAirbyte brings the power of Airbyte to every Python developer.☆315Updated this week
- Drop-in replacement for Apache Spark UI☆382Updated last month
- Sling is a CLI tool that extracts data from a source storage/database and loads it in a target storage/database.☆753Updated this week
- Open Control Plane for Tables in Data Lakehouse☆377Updated this week
- Dagster Labs' open-source data platform, built with Dagster.☆428Updated this week
- PySpark test helper methods with beautiful error messages☆746Updated this week
- Delta Lake helper methods in PySpark☆326Updated last year
- Home of the Open Data Contract Standard (ODCS).☆632Updated last week
- CLI that makes it easy to create, test and deploy Airflow DAGs to Astronomer☆429Updated this week