canimus / cuallee
Possibly the fastest DataFrame-agnostic quality check library in town.
☆181Updated this week
Alternatives and similar repositories for cuallee:
Users that are interested in cuallee are comparing it to the libraries listed below
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆199Updated this week
- Turning PySpark Into a Universal DataFrame API☆365Updated this week
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆64Updated 4 months ago
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.☆115Updated 3 weeks ago
- 🏃♀️ Minimalist alternative to dbt☆236Updated this week
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 6 months ago
- The smallest DuckDB SQL orchestrator on Earth.☆271Updated 2 weeks ago
- A lightweight Python-based tool for extracting and analyzing data column lineage for dbt projects☆134Updated 2 months ago
- A dbt-core plugin to weave together multi-project dbt-core deployments☆137Updated 2 weeks ago
- All things awesome related to Dagster!☆95Updated this week
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆182Updated this week
- Dagster Labs' open-source data platform, built with Dagster.☆317Updated this week
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆110Updated last week
- Snowflake-specific utility macros for dbt projects.☆108Updated 7 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆49Updated last year
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆213Updated 3 weeks ago
- Contribute to dlt verified sources 🔥☆80Updated 2 weeks ago
- A Python Library to support running data quality rules while the spark job is running⚡☆171Updated 3 weeks ago
- Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!☆171Updated last week
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆255Updated this week
- Schema modelling framework for decentralised domain-driven ownership of data.☆250Updated last year
- This package generates database constraints based on the tests in a dbt project☆147Updated 2 months ago
- Code snippets for Data Engineering Design Patterns book☆68Updated last week
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆123Updated 6 months ago
- A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.☆184Updated last year
- Slow & local data allows you to move fast and deliver business value for the 99.9% of the data challenges.☆134Updated last week
- The Modern Data Stack in a Python package☆49Updated last year
- Linter for dbt metadata☆125Updated 2 weeks ago
- ☆73Updated 4 months ago
- Delta Lake helper methods in PySpark☆315Updated 5 months ago