capitalone / datacompy
Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!
☆484Updated this week
Related projects ⓘ
Alternatives and complementary repositories for datacompy
- Turning PySpark Into a Universal DataFrame API☆323Updated this week
- Snowflake Snowpark Python API☆272Updated this week
- PySpark test helper methods with beautiful error messages☆621Updated 3 weeks ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆166Updated last year
- Dagster Labs' open-source data platform, built with Dagster.☆284Updated this week
- Distributed SQL Engine in Python using Dask☆397Updated 2 months ago
- Making DAG construction easier☆244Updated last week
- Apache Airflow integration for dbt☆396Updated 6 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆174Updated this week
- Python API for Deequ☆730Updated last month
- Great Expectations Airflow operator☆159Updated 3 weeks ago
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)☆923Updated this week
- pyspark methods to enhance developer productivity 📣 👯 🎉☆643Updated last month
- Delta Lake helper methods in PySpark☆304Updated 2 months ago
- Apache PyIceberg☆473Updated this week
- dbt adapter for SQL Server and Azure SQL☆216Updated 3 weeks ago
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆623Updated last week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,013Updated last month
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆111Updated 7 months ago
- SQLAlchemy driver for DuckDB☆355Updated this week
- Snowflake Connector for Python☆599Updated this week
- A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.☆180Updated last year
- A dbt package for modelling dbt metadata. https://brooklyn-data.github.io/dbt_artifacts☆331Updated 2 weeks ago
- This dbt package contains macros to support unit testing that can be (re)used across dbt projects.☆423Updated 3 months ago
- List of `pre-commit` hooks to ensure the quality of your `dbt` projects.☆599Updated last week
- Generate and Visualize Data Lineage from query history☆311Updated last year
- Data pipeline with dbt, Airflow, Great Expectations☆158Updated 3 years ago
- Macros that generate dbt code☆492Updated last month
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,364Updated this week