ashkangoleh / pyiceberg-lakehouseLinks
☆14Updated 4 months ago
Alternatives and similar repositories for pyiceberg-lakehouse
Users that are interested in pyiceberg-lakehouse are comparing it to the libraries listed below
Sorting:
- Demonstrating the capabilities of DuckDB as a transformation engine for data lakes☆30Updated last year
- This repo demonstrates an Apache Arrow Flight server implementation in Kubernetes.☆11Updated 11 months ago
- API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of B…☆44Updated 3 months ago
- Rust based DuckDB Server with 1st class Postgres interface for BI Tools like Power BI and Tableau. Streaming interfaces like Web, Kafka, …☆49Updated last week
- Iceberg Playground in a Box☆67Updated 3 months ago
- ☆11Updated 10 months ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated last year
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆89Updated 7 months ago
- Enables Python developers to leverage Debezium's CDC capabilities with custom event handlers and seamless integration.☆35Updated last month
- Arrow Flight SQL Server☆111Updated 3 months ago
- ☆54Updated last week
- 🚀 GizmoSQL — High-Performance SQL Server☆190Updated this week
- DuckDB API Server with Arrow Flight SQL Airport support and concurrent writes/reads (quackpipe)☆104Updated 7 months ago
- Python wrapper for the Sling CLI tool☆57Updated last week
- ☆59Updated 5 months ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆102Updated this week
- Python package for querying iceberg data through duckdb.☆70Updated last year
- Unity Catalog UI☆43Updated last year
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- Example project demonstrating deployment patterns for real-time streaming workflows with Prefect 2.0☆45Updated 3 years ago
- ☆16Updated 10 months ago
- ☆30Updated 10 months ago
- dlt-dagster-demo☆12Updated last year
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Updated last year
- A Rust based data/CSV/Parquet file generator☆58Updated 7 months ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated last year
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated 2 years ago
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆59Updated this week
- ☆39Updated 5 months ago
- Yet Another (Spark) ETL Framework☆21Updated last year