eakmanrq / sqlframeLinks
Turning PySpark Into a Universal DataFrame API
☆403Updated this week
Alternatives and similar repositories for sqlframe
Users that are interested in sqlframe are comparing it to the libraries listed below
Sorting:
- The smallest DuckDB SQL orchestrator on Earth.☆307Updated 3 weeks ago
- ☆278Updated last week
- A lightweight Python-based tool for extracting and analyzing data column lineage for dbt projects☆164Updated 2 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆190Updated last week
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)☆1,086Updated this week
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆216Updated 3 weeks ago
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆120Updated 3 months ago
- 🏃♀️ Minimalist SQL orchestrator☆244Updated this week
- Apache PyIceberg☆750Updated this week
- A Postgres Proxy Server in Python☆283Updated 5 months ago
- DuckDB extension for Delta Lake☆188Updated last week
- Pushdown compute from Snowflake to DuckDB running on your infrastructure☆183Updated last week
- A dbt-core plugin to weave together multi-project dbt-core deployments☆148Updated last week
- deferred, multi-engine computational framework☆277Updated this week
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆230Updated 3 months ago
- ☆82Updated last week
- Dagster Labs' open-source data platform, built with Dagster.☆360Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆330Updated 2 years ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆254Updated last year
- Delta Lake helper methods in PySpark☆326Updated 8 months ago
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆263Updated last week
- Database connectivity API standard and libraries for Apache Arrow☆450Updated this week
- Work with your web service, database, and streaming schemas in a single format.☆343Updated this week
- Proof-of-concept extension combining the delta extension with Unity Catalog☆84Updated this week
- A playground for running duckdb as a stateless query engine over a data lake☆201Updated last year
- Run, mock and test fake Snowflake databases locally.☆138Updated last week
- The Airport extension for DuckDB, enables the use of Arrow Flight with DuckDB☆236Updated this week
- A Python Library to support running data quality rules while the spark job is running⚡☆188Updated this week
- Open Control Plane for Tables in Data Lakehouse☆352Updated this week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆250Updated 8 months ago