eakmanrq / sqlframeLinks
Turning PySpark Into a Universal DataFrame API
☆417Updated this week
Alternatives and similar repositories for sqlframe
Users that are interested in sqlframe are comparing it to the libraries listed below
Sorting:
- The smallest DuckDB SQL orchestrator on Earth.☆315Updated 2 months ago
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆122Updated 5 months ago
- ☆149Updated 2 months ago
- 🏃♀️ Minimalist SQL orchestrator☆257Updated this week
- A lightweight Python-based tool for extracting and analyzing data column lineage for dbt projects☆176Updated 4 months ago
- ☆298Updated this week
- Possibly the fastest DataFrame-agnostic quality check library in town.☆198Updated last week
- A Postgres Proxy Server in Python☆295Updated 7 months ago
- Catalog, compose, and ship ML—Python simplicity, SQL scale.☆365Updated this week
- Dagster Labs' open-source data platform, built with Dagster.☆382Updated this week
- Proof-of-concept extension combining the delta extension with Unity Catalog☆89Updated 3 weeks ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆115Updated 4 months ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆254Updated last year
- Run, mock and test fake Snowflake databases locally.☆145Updated this week
- A dbt-core plugin to weave together multi-project dbt-core deployments☆164Updated 3 weeks ago
- ☆136Updated last week
- DuckDB extension for Delta Lake☆194Updated last week
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plu…☆61Updated 2 years ago
- Apache PyIceberg☆819Updated this week
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆218Updated this week
- Linter for dbt metadata☆169Updated this week
- A playground for running duckdb as a stateless query engine over a data lake☆210Updated last year
- The Airport extension for DuckDB, enables the use of Arrow Flight with DuckDB☆276Updated this week
- Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!☆209Updated 2 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆235Updated 5 months ago
- Pythonic Iceberg REST Catalog☆3Updated last month
- Pushdown compute from Snowflake to DuckDB running on your infrastructure☆187Updated 3 weeks ago
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆330Updated 2 years ago
- Work with your web service, database, and streaming schemas in a single format.☆345Updated last month
- PyAirbyte brings the power of Airbyte to every Python developer.☆281Updated this week