eakmanrq / sqlframe
Turning PySpark Into a Universal DataFrame API
☆385Updated this week
Alternatives and similar repositories for sqlframe:
Users that are interested in sqlframe are comparing it to the libraries listed below
- The smallest DuckDB SQL orchestrator on Earth.☆298Updated 2 months ago
- ☆254Updated this week
- A lightweight Python-based tool for extracting and analyzing data column lineage for dbt projects☆155Updated 3 weeks ago
- A dbt-core plugin to weave together multi-project dbt-core deployments☆143Updated 2 weeks ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆252Updated last year
- 🏃♀️ Minimalist SQL orchestrator☆243Updated this week
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆118Updated 2 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆186Updated last week
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)☆1,056Updated this week
- Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!☆187Updated last week
- Dagster Labs' open-source data platform, built with Dagster.☆344Updated this week
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆261Updated last week
- Apache PyIceberg☆687Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆327Updated 2 years ago
- Delta Lake helper methods in PySpark☆322Updated 7 months ago
- Work with your web service, database, and streaming schemas in a single format.☆343Updated 2 weeks ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆215Updated last week
- A Postgres Proxy Server in Python☆275Updated 4 months ago
- dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipe…☆431Updated this week
- The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆☆177Updated 2 weeks ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆111Updated 3 weeks ago
- Date-related macros for dbt☆243Updated 4 months ago
- DuckDB extension for Delta Lake☆176Updated 2 weeks ago
- PyAirbyte brings the power of Airbyte to every Python developer.☆261Updated last week
- Useful macros when performing data audits☆356Updated 3 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆225Updated 2 months ago
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.☆369Updated this week
- Quickstart for any service☆145Updated this week
- Proof-of-concept extension combining the delta extension with Unity Catalog☆82Updated last month
- Open Control Plane for Tables in Data Lakehouse☆341Updated last week