☆65May 9, 2025Updated 9 months ago
Alternatives and similar repositories for duckhouse
Users that are interested in duckhouse are comparing it to the libraries listed below
Sorting:
- ☆185May 21, 2025Updated 9 months ago
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆25Sep 8, 2023Updated 2 years ago
- A compute manifest and composable tools for data, built on Ibis, DataFusion, and Arrow Flight.☆487Updated this week
- A Table format agnostic data sharing framework☆42Feb 4, 2024Updated 2 years ago
- ☆16Nov 27, 2025Updated 3 months ago
- ☆11Nov 26, 2024Updated last year
- parquet dedupe estimator☆25Feb 20, 2026Updated last week
- This repo demonstrates an Apache Arrow Flight server implementation in Kubernetes.☆12Oct 25, 2024Updated last year
- Latte is a modern data engineering toolkit.☆13Mar 4, 2024Updated last year
- Arrow-Powered Data Exchange☆15Feb 7, 2025Updated last year
- duckdb-etl-framework☆15Dec 20, 2024Updated last year
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆83Apr 12, 2025Updated 10 months ago
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆96Feb 22, 2025Updated last year
- DuckDB Cron Expression Extension☆28Jun 23, 2024Updated last year
- A conda-smithy repository for python-duckdb.☆13Jan 29, 2026Updated last month
- This project automates setup of Cost and Usage Reports (CUR) in a billing account with an Athena table enabling querying of the latest da…☆12Updated this week
- Arrow Flight SQL Server☆129Jun 21, 2025Updated 8 months ago
- ☆18Jun 5, 2023Updated 2 years ago
- Package hub for dbt.☆31Updated this week
- dbc is the command-line tool for installing and managing ADBC drivers☆92Updated this week
- An in-process Parquet merge engine for better data warehousing in S3 with MVCC☆151Feb 15, 2026Updated 2 weeks ago
- DuckDB Pyroscope Extension for Continuous Profiling☆21Feb 18, 2026Updated last week
- Documentation and resources for deploying JupyterHub on Hadoop☆19Jul 16, 2019Updated 6 years ago
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆280Sep 25, 2024Updated last year
- 🚀 GizmoSQL — High-Performance SQL Server☆290Feb 24, 2026Updated last week
- DuckDB CronJob Extension☆47Feb 18, 2026Updated last week
- Extension for DuckDB for functions that require the Apache Arrow dependency☆45May 12, 2025Updated 9 months ago
- A nats micro service interacting with Ollama☆18Jun 30, 2024Updated last year
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆124Mar 31, 2025Updated 11 months ago
- Parquet Command-line Tools☆19Oct 26, 2016Updated 9 years ago
- Open Data Stack Platform: a collection of projects and pipelines built with open data stack tools for scalable, observable data platform…☆22Dec 21, 2025Updated 2 months ago
- A write-audit-publish implementation on a data lake without the JVM☆45Aug 12, 2024Updated last year
- Demonstration of a Hive Input Format for Iceberg☆26Mar 12, 2021Updated 4 years ago
- ☆22Jul 18, 2024Updated last year
- rust-for-data☆50Jul 12, 2023Updated 2 years ago
- DuckDB for streaming data☆748Sep 4, 2025Updated 5 months ago
- The open-source Useful SDK. One python decorator in the Useful library allows for full observability of Python functions within an ETL.☆19Jan 11, 2024Updated 2 years ago
- ☆21Jul 23, 2025Updated 7 months ago
- Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …☆22Feb 6, 2017Updated 9 years ago