kwanUm / awesome-data-qualityLinks
Curated list of tools and frameworks assisting in monitoring data quality
β15Updated 3 years ago
Alternatives and similar repositories for awesome-data-quality
Users that are interested in awesome-data-quality are comparing it to the libraries listed below
Sorting:
- Contribute to dlt verified sources π₯β101Updated 2 weeks ago
- β365Updated 2 weeks ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observβ¦β176Updated last week
- Declarative context engineering for agentsβ418Updated this week
- β40Updated 8 months ago
- Pushdown compute from Snowflake to DuckDB running on your infrastructureβ201Updated 2 months ago
- A framework to manage data, continuouslyβ32Updated 11 months ago
- Cost Efficient Data Pipelines with DuckDBβ60Updated 7 months ago
- Data Product Portal created by Datamindedβ197Updated this week
- A portable Datamart and Business Intelligence suite built with Docker, Mage, dbt, DuckDB and Supersetβ53Updated 2 weeks ago
- Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team β¦β129Updated last month
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those pluβ¦β59Updated 2 years ago
- Possibly the fastest DataFrame-agnostic quality check library in town.β233Updated 2 months ago
- A playground for running duckdb as a stateless query engine over a data lakeβ217Updated last year
- Demo Project for Open Source MDSβ168Updated 4 months ago
- Data management with LLMsβ179Updated 11 months ago
- Next generation compute platform for the post-modern data stackβ20Updated this week
- πββοΈ Minimalist SQL orchestratorβ296Updated this week
- β81Updated 10 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.β50Updated 2 years ago
- β167Updated 7 months ago
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examplesβ123Updated 10 months ago
- Python package for querying iceberg data through duckdb.β70Updated last year
- A write-audit-publish implementation on a data lake without the JVMβ45Updated last year
- An end-to-end LLM reference implementation providing a Q&A interface for Airflow and Astronomerβ276Updated 5 months ago
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.β177Updated this week
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualitβ¦β68Updated last week
- Datailot-cli is the command line interface for accessing the AI teammate for engineers to ensure best practices in their SQL and dbt projβ¦β35Updated this week
- A CLI tool to streamline getting started with Apache Airflowβ’ and managing multiple Airflow projectsβ225Updated 7 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Supersetβ257Updated 2 weeks ago