A comprehensive collection of data quality resources, tools, papers, and projects across various data types including traditional data, LLM pretraining/fine-tuning data, multimodal data, and more. Essential reference for researchers and practitioners in data-centric AI.
☆26Aug 29, 2025Updated 6 months ago
Alternatives and similar repositories for awesome-data-quality
Users that are interested in awesome-data-quality are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A general-purpose API load testing platform that supports LLM services and business HTTP interfaces, enabling one-click performance testi…☆182Updated this week
- Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool☆666Updated this week
- Curated list of tools and frameworks assisting in monitoring data quality☆15Apr 3, 2022Updated 3 years ago
- Data table powered by silex and vue2☆11May 17, 2017Updated 8 years ago
- Cross-Platform Annotation Tool for Person Search Datasets☆11Aug 29, 2017Updated 8 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Source codes for paper "Harnessing Machine Learning to Enhance Transition State Search with Interatomic Potentials and Generative Models"☆18Oct 23, 2025Updated 5 months ago
- Partial least squares regression☆10May 13, 2025Updated 10 months ago
- [ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos☆25Aug 8, 2025Updated 7 months ago
- Baseline, check and correct your SQL Database Security☆12Mar 9, 2022Updated 4 years ago
- Badgers: Bad Data Generators☆14Jan 29, 2026Updated last month
- Autonomous web browser agent that audits performance, functionality & UX for engineers and vibe-coding creators. 网站自主评估测试 Agent,一键完成性能、功能…☆179Mar 9, 2026Updated 2 weeks ago
- An easy-to-use react chat plugin☆10Jan 5, 2023Updated 3 years ago
- ZBar wrapper for Python 3☆10Apr 30, 2015Updated 10 years ago
- A web interface for Torque Resource Manager☆19Jan 9, 2014Updated 12 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A fast Ramer-Douglas-Peucker algorithm implementation.☆15Sep 3, 2023Updated 2 years ago
- Send customized alerts for your dbt project with simple tags☆10Jul 27, 2021Updated 4 years ago
- High-level Rust library that binds to Poppler to extract text from a PDF☆11Dec 16, 2020Updated 5 years ago
- A knowledge graph system with graph neural network for drug repurposing and disease mechanism.☆18Sep 12, 2025Updated 6 months ago
- Code for the DiscoTope-3.0 paper and model☆14Mar 19, 2026Updated last week
- Viewer for .avro files☆12Dec 8, 2022Updated 3 years ago
- Elucidate and visualise a compound's mechanism of action by combining structure-based target prediction with gene expression-based causal…☆13Feb 11, 2023Updated 3 years ago
- This demo will help you get started with AWS IoT Secure Tunneling, that helps customers establish bidirectional communication to remote d…☆20Dec 8, 2024Updated last year
- Automatically scrape news using Google Gemini API, generate articles, and upload them to Meta Threads☆14Aug 24, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A C# API wrapper for the Threads API.☆14Jan 17, 2025Updated last year
- Grad-CAM for weakly object detection☆12Dec 19, 2018Updated 7 years ago
- Prevent downstream data quality issues by integrating the Soda Library into your CI/CD pipeline.☆17Jan 29, 2026Updated last month
- Threads-Projects: Unleashing the power of Meta's Threads.net platform with insightful bots and efficient workflows☆14Jan 10, 2024Updated 2 years ago
- Alignment, a collaborative, system aided, user driven ontology/vocabulary matching and validation platform.☆13Mar 29, 2022Updated 3 years ago
- ☆17Mar 13, 2023Updated 3 years ago
- Flutter ListView☆11Jan 9, 2023Updated 3 years ago
- MCP that provides controlled and secure SQL Server database access for LLM applications.☆25Nov 13, 2025Updated 4 months ago
- Takes your Threads posts URL and converts it to an image (threadimage)☆10Jan 28, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10Apr 22, 2021Updated 4 years ago
- Full spreadsheet-style pivot table through SQL macros. Just specify values, rows, columns, and filters!☆19Feb 25, 2026Updated last month
- Unification of Directed Acyclic Graphs in Clojure☆21Oct 26, 2025Updated 5 months ago
- Python code to programmatically access iTunes Connect☆12Mar 9, 2016Updated 10 years ago
- Homework for STAT 205A - Berkeley☆13Dec 9, 2014Updated 11 years ago
- Azure data studio extension to search tables, stored procedures, functions by name or code.☆15Feb 13, 2024Updated 2 years ago
- Example code and data samples for "An experimentally validated approach to automated biological evidence generation in drug discovery usi…☆12Jan 25, 2024Updated 2 years ago