kwanUm / awesome-data-quality
Curated list of tools and frameworks assisting in monitoring data quality
☆12Updated 2 years ago
Alternatives and similar repositories for awesome-data-quality:
Users that are interested in awesome-data-quality are comparing it to the libraries listed below
- DuckDB Community Extension to prompt LLMs from SQL☆35Updated last month
- Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applications☆87Updated 4 months ago
- Sample configuration to deploy a modern data platform.☆87Updated 3 years ago
- Data Tools Subjective List☆83Updated last year
- The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆☆157Updated this week
- Transform your pythonic research to an artifact that engineers can deploy easily.☆150Updated 9 months ago
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆209Updated last week
- A curated list of awesome blogs, videos, tools and resources about Data Contracts☆171Updated 6 months ago
- 🚀 Stream inferences of real-time ML models in production to any data lake (Experimental)☆79Updated 2 years ago
- HyPSTER - HyperParameter optimization on STERoids☆46Updated 2 months ago
- ☆93Updated last year
- Airbyte made simple (no UI, no database, no cluster)☆163Updated 3 months ago
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆96Updated this week
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 5 months ago
- A playground for running duckdb as a stateless query engine over a data lake☆184Updated last year
- High-scale LLM gateway, written in Rust. OpenTelemetry-based observability included☆50Updated 2 months ago
- Python wrapper for the Sling CLI tool☆45Updated this week
- A curated list of awesome DataOps tools☆174Updated 4 months ago
- Contribute to dlt verified sources 🔥☆80Updated 3 weeks ago
- A dbt package for doing product analytics☆85Updated 2 years ago
- A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profil…☆72Updated 9 months ago
- Quickstart for any service☆138Updated this week
- ☆22Updated 8 months ago
- Metrics Observability & Troubleshooting☆318Updated 11 months ago
- Low-code Python library to safely use notebooks in production: schedule workflows, generate assets, trigger webhooks, send notifications,…☆285Updated last week
- This repo demonstrate a comprehensive modern data stack using popular open-source tools.☆28Updated last year
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆225Updated last week
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆82Updated this week
- ⚡ valmi.io reverse ETL (data activation) is the open source ( OSS ) data activation platform to load data from warehouses into Webhooks a…☆149Updated 7 months ago