eakmanrq / sqlframe
Turning PySpark Into a Universal DataFrame API
☆366Updated this week
Alternatives and similar repositories for sqlframe:
Users that are interested in sqlframe are comparing it to the libraries listed below
- The smallest DuckDB SQL orchestrator on Earth.☆279Updated 2 weeks ago
- A lightweight Python-based tool for extracting and analyzing data column lineage for dbt projects☆136Updated 2 months ago
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆318Updated last year
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆255Updated last week
- 🏃♀️ Minimalist alternative to dbt☆236Updated this week
- A dbt-core plugin to weave together multi-project dbt-core deployments☆137Updated 3 weeks ago
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆111Updated 2 weeks ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆250Updated last year
- ☆215Updated this week
- DuckDB extension for Delta Lake☆163Updated last week
- dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipe…☆410Updated this week
- Possibly the fastest DataFrame-agnostic quality check library in town.☆181Updated last week
- A Postgres Proxy Server in Python☆269Updated 2 months ago
- Apache PyIceberg☆596Updated this week
- Work with your web service, database, and streaming schemas in a single format.☆337Updated 10 months ago
- PyAirbyte brings the power of Airbyte to every Python developer.☆249Updated this week
- A dbt package for modelling dbt metadata. https://brooklyn-data.github.io/dbt_artifacts☆350Updated 3 weeks ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆199Updated last week
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆213Updated this week
- Date-related macros for dbt☆243Updated 2 months ago
- The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆☆157Updated this week
- Delta Lake helper methods in PySpark☆315Updated 5 months ago
- A web API for dbt.☆109Updated 2 months ago
- Dagster Labs' open-source data platform, built with Dagster.☆318Updated this week
- This package contains macros and models to find DAG issues automatically☆465Updated 3 weeks ago
- Useful macros when performing data audits☆345Updated 3 weeks ago
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)☆992Updated this week
- List of `pre-commit` hooks to ensure the quality of your `dbt` projects.☆625Updated 2 weeks ago
- Pythonic Iceberg REST Catalog☆72Updated 5 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆172Updated this week