kwanUm / awesome-data-quality
Curated list of tools and frameworks assisting in monitoring data quality
☆11Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-data-quality
- Entity resolution for everyone. Minimal. No dependencies.☆10Updated 3 months ago
- Sample configuration to deploy a modern data platform.☆86Updated 2 years ago
- Airbyte made simple (no UI, no database, no cluster)☆150Updated 2 weeks ago
- HyPSTER - HyperParameter optimization on STERoids☆35Updated last week
- ☆295Updated last month
- Quickstart for any service☆131Updated this week
- A playground for running duckdb as a stateless query engine over a data lake☆171Updated 10 months ago
- A dbt package for doing product analytics☆84Updated 2 years ago
- Python wrapper for the Sling CLI tool☆43Updated last month
- Data Tools Subjective List☆80Updated last year
- The metrics layer for your data. Join us at https://metriql.com/slack☆298Updated last year
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆195Updated this week
- Write python locally, execute SQL in your data warehouse☆270Updated 2 years ago
- Examples of using Prefect for background tasks in web applications☆19Updated this week
- Instantly understand and summarize JSON structure through automatic schema inference via a Python CLI☆21Updated 2 weeks ago
- Write 70% less code by using the SDK to build custom extractors and loaders that adhere to the Singer standard: https://sdk.meltano.com☆98Updated this week
- ⚡ valmi.io reverse ETL (data activation) is the open source ( OSS ) data activation platform to load data from warehouses into Webhooks a…☆143Updated 4 months ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆247Updated 11 months ago
- Graphsignal Tracer for Python☆202Updated 3 months ago
- Add and see other's reactions to your code!☆31Updated last year
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆91Updated this week
- Data Catalogs Made Easy☆18Updated last month
- ☆38Updated this week
- A write-audit-publish implementation on a data lake without the JVM☆41Updated 3 months ago
- Read infrastructure data from your cloud ☁️ and export it to a SQL database 📋.☆32Updated last year
- Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applications☆78Updated last month
- Query Snowflake tables locally with DuckDB, without any need for a running warehouse☆101Updated this week
- S3 vector database for LLM Agents and RAG.☆29Updated last year
- Straightforward implementation of some important machine learning algorithms and components.☆9Updated 5 months ago