Awesome material(papers, tools, etc.) about testing machine learning system, including deep learning system.
☆47Oct 12, 2021Updated 4 years ago
Alternatives and similar repositories for awesome-ml-testing
Users that are interested in awesome-ml-testing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- EvalDNN: A Toolbox for Evaluating Deep Neural Network Models☆14Mar 9, 2020Updated 6 years ago
- Taxonomy of Real Faults in Deep Learning Systems☆15Jan 27, 2020Updated 6 years ago
- A Clone-Based Approach for Recommending Modification on Pasted Code☆12Jun 10, 2017Updated 8 years ago
- Pattern Fuzzing for Worst-Case Algorithmic Complexity using Program Synthesis☆20Aug 24, 2021Updated 4 years ago
- ☆12Jul 21, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- LLM evaluation.☆16Nov 7, 2023Updated 2 years ago
- Structure-Invariant Testing for Machine Translation [ICSE'20]☆16Dec 17, 2020Updated 5 years ago
- Collect simple coverage information in memory.☆11Oct 6, 2022Updated 3 years ago
- Code release of a paper "Guiding Deep Learning System Testing using Surprise Adequacy"☆49May 26, 2022Updated 4 years ago
- ☆19Jun 25, 2025Updated 11 months ago
- A project for computing differences of multiple clone instances.☆17Nov 4, 2019Updated 6 years ago
- Indexing reachability for context-sensitive data flow analysis.☆12Jul 10, 2022Updated 3 years ago
- A tool for learning bug patterns.☆11Jul 19, 2016Updated 9 years ago
- Java Ranger is a path-merging extension of Symbolic PathFinder☆16Mar 16, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A benchmark suite (under construction) for smart contract vulnerability tools☆17Jul 13, 2021Updated 4 years ago
- A fuzzer for SMT solvers☆21May 8, 2026Updated last month
- To generate decoy traffic against WF attack using GAN☆13Jul 17, 2025Updated 10 months ago
- This is the implement repository of our upcoming ESEC/FSE 2020 paper: Deep Learning Library Testing via Effective Model Generation.☆56Oct 29, 2023Updated 2 years ago
- A Reduction Tool for SQL Bachelor's thesis of Jonas Müntener☆16Oct 15, 2024Updated last year
- TVMFuzz: fuzzing tensor-level intermediate representation in TVM☆32May 24, 2020Updated 6 years ago
- Automated DNN generation for fuzz testing and more☆147Jan 14, 2025Updated last year
- 北大树洞爬虫☆11Jul 30, 2020Updated 5 years ago
- Checks the PDFs submitted to a conference, e.g., for formatting violations and double anonymous violations☆68Dec 11, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- GenCoG: A DSL-Based Approach to Generating Computation Graphs for TVM Testing (ISSTA‘23)☆17Jul 19, 2023Updated 2 years ago
- Concolic Testing for Deep Neural Networks☆118Jul 16, 2021Updated 4 years ago
- ☆23Mar 20, 2021Updated 5 years ago
- A list of bugs found by SQLancer☆17Jan 30, 2024Updated 2 years ago
- ☆15Jan 23, 2020Updated 6 years ago
- A model checker and assume/guarantee contract generator for Lustre programs.☆15Jun 5, 2018Updated 8 years ago
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆35Jul 2, 2024Updated last year
- Characterizing Transaction-Reverting Statements in Ethereum Smart Contracts.☆11Sep 1, 2021Updated 4 years ago
- Analyze execution trace to find regression bug☆41Jun 2, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Fast and Precise On-the-fly Patch Validation for All☆10Feb 24, 2023Updated 3 years ago
- WhiteFox: White-Box Compiler Fuzzing Empowered by Large Language Models (OOPSLA 2024)☆83Aug 5, 2025Updated 10 months ago
- Static program analysis framework for Ethereum smart contract bytecode.☆168Apr 13, 2026Updated last month
- Robustness benchmark for DNN models.☆66Aug 8, 2022Updated 3 years ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Feb 15, 2024Updated 2 years ago
- A Static Differential Analysis Tool of Network Protocol Parsers☆29Feb 21, 2024Updated 2 years ago
- An experimental framework for temporal verification based on first-order linear-time temporal logic. Our goal is to express transition sy…☆23Mar 29, 2026Updated 2 months ago