eakmanrq / sqlframe
Turning PySpark Into a Universal DataFrame API
☆349Updated this week
Alternatives and similar repositories for sqlframe:
Users that are interested in sqlframe are comparing it to the libraries listed below
- The smallest DuckDB SQL orchestrator on Earth.☆193Updated 4 months ago
- Dagster Labs' open-source data platform, built with Dagster.☆301Updated this week
- Schema modelling framework for decentralised domain-driven ownership of data.☆249Updated last year
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆206Updated last month
- Apache PyIceberg☆551Updated this week
- DuckDB extension for Delta Lake☆152Updated this week
- 🏃♀️ Minimalist alternative to dbt☆228Updated this week
- Possibly the fastest DataFrame-agnostic quality check library in town.☆180Updated this week
- A lightweight Python-based tool for extracting and analyzing data column lineage for dbt projects☆125Updated last month
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)☆968Updated last week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆315Updated last year
- A Postgres Proxy Server in Python☆265Updated last month
- ☆191Updated last week
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆100Updated this week
- PyAirbyte brings the power of Airbyte to every Python developer.☆241Updated this week
- Delta Lake helper methods in PySpark☆312Updated 4 months ago
- A dbt-core plugin to weave together multi-project dbt-core deployments☆134Updated last week
- Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!☆165Updated last month
- Work with your web service, database, and streaming schemas in a single format.☆337Updated 9 months ago
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.☆360Updated this week
- dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipe…☆401Updated this week
- A dbt package for modelling dbt metadata. https://brooklyn-data.github.io/dbt_artifacts☆341Updated this week
- A playground for running duckdb as a stateless query engine over a data lake☆184Updated last year
- A Python Library to support running data quality rules while the spark job is running⚡☆167Updated last week
- A web API for dbt.☆110Updated last month
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆255Updated 2 weeks ago
- This package contains macros and models to find DAG issues automatically☆460Updated last month
- Fake Snowflake Connector for Python. Run, mock and test Snowflake DB locally.☆111Updated 2 weeks ago
- Date-related macros for dbt☆239Updated last month