canimus / cuallee
Possibly the fastest DataFrame-agnostic quality check library in town.
☆171Updated this week
Related projects ⓘ
Alternatives and complementary repositories for cuallee
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆187Updated last week
- Delta Lake helper methods in PySpark☆304Updated 2 months ago
- The smallest DuckDB SQL orchestrator on Earth.☆171Updated last month
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆191Updated this week
- Turning PySpark Into a Universal DataFrame API☆317Updated this week
- A Python Library to support running data quality rules while the spark job is running⚡☆162Updated this week
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆145Updated 2 weeks ago
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.☆110Updated 3 months ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆61Updated last month
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆221Updated last week
- All things awesome related to Dagster!☆79Updated 2 weeks ago
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆22Updated 7 months ago
- Fake Snowflake Connector for Python. Run, mock and test Snowflake DB locally.☆108Updated this week
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset☆177Updated this week
- A lightweight Python-based tool for extracting and analyzing data column lineage for dbt projects☆107Updated 3 weeks ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆48Updated last year
- an ephemeral project repo for the DU Dagster project☆52Updated this week
- Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!☆146Updated 2 months ago
- ✨ A Pydantic to PySpark schema library☆55Updated this week
- Code for dbt tutorial☆143Updated 5 months ago
- [DEPRECATED] A dbt adapter for Excel.☆91Updated last year
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆109Updated 3 months ago
- Delta Lake helper methods. No Spark dependency.☆22Updated 2 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆111Updated 7 months ago
- Delta Lake examples☆205Updated last month
- Schema modelling framework for decentralised domain-driven ownership of data.☆247Updated 11 months ago
- Pythonic Iceberg REST Catalog☆65Updated last month
- Great Expectations Airflow operator☆159Updated last week