eakmanrq / sqlframe
Turning PySpark Into a Universal DataFrame API
☆323Updated this week
Related projects ⓘ
Alternatives and complementary repositories for sqlframe
- The smallest DuckDB SQL orchestrator on Earth.☆177Updated 2 months ago
- Apache PyIceberg☆473Updated this week
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)☆923Updated this week
- Dagster Labs' open-source data platform, built with Dagster.☆284Updated this week
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.☆350Updated this week
- 🏃♀️ Minimalist alternative to dbt☆215Updated this week
- Possibly the fastest DataFrame-agnostic quality check library in town.☆174Updated this week
- Work with your web service, database, and streaming schemas in a single format.☆332Updated 7 months ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆247Updated 11 months ago
- A lightweight Python-based tool for extracting and analyzing data column lineage for dbt projects☆113Updated last week
- A Postgres Proxy Server in Python☆253Updated last month
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset☆183Updated last week
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆189Updated last week
- A dbt-core plugin to weave together multi-project dbt-core deployments☆122Updated this week
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆249Updated 3 weeks ago
- Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!☆152Updated 3 months ago
- PyAirbyte brings the power of Airbyte to every Python developer.☆232Updated this week
- A Python Library to support running data quality rules while the spark job is running⚡☆163Updated last week
- Data product portal created by Dataminded☆146Updated this week
- DuckDB extension for Delta Lake☆136Updated last week
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆195Updated this week
- Pythonic Iceberg REST Catalog☆67Updated 2 months ago
- Delta Lake helper methods in PySpark☆304Updated 2 months ago
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆306Updated last year
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆148Updated last week
- dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipe…☆393Updated this week
- A web API for dbt.☆110Updated 9 months ago
- Open Control Plane for Tables in Data Lakehouse☆312Updated this week
- All things awesome related to Dagster!☆81Updated 3 weeks ago