Turning PySpark Into a Universal DataFrame API
☆493Feb 28, 2026Updated this week
Alternatives and similar repositories for sqlframe
Users that are interested in sqlframe are comparing it to the libraries listed below
Sorting:
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆2,928Updated this week
- The smallest DuckDB SQL orchestrator on Earth.☆337Nov 22, 2025Updated 3 months ago
- 🏃♀️ Minimalist SQL orchestrator☆307Updated this week
- A compute manifest and composable tools for data, built on Ibis, DataFusion, and Arrow Flight.☆487Updated this week
- 📦 Serverless and local-first Open Data Platform☆308Jan 22, 2026Updated last month
- Python SQL Parser and Transpiler☆8,980Updated this week
- the portable Python dataframe library☆6,417Updated this week
- DuckDB for streaming data☆748Sep 4, 2025Updated 6 months ago
- The Airport extension for DuckDB, enables the use of Arrow Flight with DuckDB☆327Feb 18, 2026Updated 2 weeks ago
- dbt adapter for DuckDB☆1,236Updated this week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆280Sep 25, 2024Updated last year
- Pushdown compute from Snowflake to DuckDB running on your infrastructure☆204Oct 20, 2025Updated 4 months ago
- ☆35Jul 23, 2023Updated 2 years ago
- The data-validation toolkit for enhanced dbt (data build tool) PR review☆440Updated this week
- Making data lake work for time series☆1,190Aug 21, 2024Updated last year
- Database connectivity API standard and libraries for Apache Arrow☆563Updated this week
- A framework to manage data, continuously☆33Jan 20, 2025Updated last year
- A curated list of awesome SQLMesh resources☆38Apr 30, 2025Updated 10 months ago
- Proof-of-concept extension combining the delta extension with Unity Catalog☆99Updated this week
- Apache DataFusion Python Bindings☆564Feb 26, 2026Updated last week
- Possibly the fastest DataFrame-agnostic quality check library in town.☆240Feb 5, 2026Updated last month
- GlareDB: A light and fast SQL database for analytics☆1,003Nov 14, 2025Updated 3 months ago
- DuckDB extension for Delta Lake☆215Feb 26, 2026Updated last week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,139Feb 21, 2026Updated last week
- Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.☆2,509Updated this week
- dbt starter code for enterprise Snowflake usage data artifacts☆21Sep 7, 2022Updated 3 years ago
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆1,203Updated this week
- A dbt-core plugin to weave together multi-project dbt-core deployments☆191Jan 24, 2026Updated last month
- data load tool (dlt) is an open source Python library that makes data loading easy 🛠️☆4,980Updated this week
- PyIceberg☆1,009Updated this week
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends☆1,980Updated this week
- Malloy is a modern open source language for describing data relationships and transformations.☆2,405Feb 26, 2026Updated last week
- 🦆 A curated list of awesome DuckDB resources☆2,297Feb 18, 2026Updated 2 weeks ago
- [DEPRECATED] A dbt adapter for Excel.☆96Apr 7, 2025Updated 10 months ago
- This repo demonstrates an Apache Arrow Flight server implementation in Kubernetes.☆12Oct 25, 2024Updated last year
- A native Rust library for Delta Lake, with bindings into Python☆3,160Updated this week
- Light-weight, browser-based ROLAP pivot tables on top of DuckDB-WASM☆563Jan 26, 2026Updated last month
- DuckDB HTTP API Server and Query Interface in a Community Extension☆273Feb 18, 2026Updated 2 weeks ago
- Analytical database for data-driven Web applications 🪶☆509Feb 25, 2025Updated last year