Awesome material(papers, tools, etc.) about testing machine learning system, including deep learning system.
☆47Oct 12, 2021Updated 4 years ago
Alternatives and similar repositories for awesome-ml-testing
Users that are interested in awesome-ml-testing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- EvalDNN: A Toolbox for Evaluating Deep Neural Network Models☆14Mar 9, 2020Updated 6 years ago
- Taxonomy of Real Faults in Deep Learning Systems☆15Jan 27, 2020Updated 6 years ago
- ☆19Dec 8, 2022Updated 3 years ago
- A program analysis, verification, and optimization framework☆27Apr 13, 2026Updated last week
- A Clone-Based Approach for Recommending Modification on Pasted Code☆12Jun 10, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Pattern Fuzzing for Worst-Case Algorithmic Complexity using Program Synthesis☆20Aug 24, 2021Updated 4 years ago
- ☆12Jul 21, 2023Updated 2 years ago
- Structure-Invariant Testing for Machine Translation [ICSE'20]☆16Dec 17, 2020Updated 5 years ago
- Collect simple coverage information in memory.☆11Oct 6, 2022Updated 3 years ago
- Code release of a paper "Guiding Deep Learning System Testing using Surprise Adequacy"☆50May 26, 2022Updated 3 years ago
- ☆19Jun 25, 2025Updated 9 months ago
- Indexing reachability for context-sensitive data flow analysis.☆12Jul 10, 2022Updated 3 years ago
- This is the implement repository of our upcoming ESEC/FSE 2020 paper: Deep Learning Library Testing via Effective Model Generation.☆56Oct 29, 2023Updated 2 years ago
- TVMFuzz: fuzzing tensor-level intermediate representation in TVM☆32May 24, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- scripts for testing TiDB☆10Feb 4, 2026Updated 2 months ago
- This repository contains the implementation and the evaluation of our ESEC/FSE 2020 paper: Detecting Numerical Bugs in Neural Network Ar…☆25Dec 17, 2020Updated 5 years ago
- Automated DNN generation for fuzz testing and more☆147Jan 14, 2025Updated last year
- A random Solidity program generator.☆132Jan 4, 2026Updated 3 months ago
- Checks the PDFs submitted to a conference, e.g., for formatting violations and double anonymous violations☆67Dec 11, 2021Updated 4 years ago
- GenCoG: A DSL-Based Approach to Generating Computation Graphs for TVM Testing (ISSTA‘23)☆17Jul 19, 2023Updated 2 years ago
- Concolic Testing for Deep Neural Networks☆118Jul 16, 2021Updated 4 years ago
- ☆23Mar 20, 2021Updated 5 years ago
- A list of bugs found by SQLancer☆17Jan 30, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆15Jan 23, 2020Updated 6 years ago
- A model checker and assume/guarantee contract generator for Lustre programs.☆15Jun 5, 2018Updated 7 years ago
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆35Jul 2, 2024Updated last year
- Characterizing Transaction-Reverting Statements in Ethereum Smart Contracts.☆11Sep 1, 2021Updated 4 years ago
- An experimental framework for temporal verification based on first-order linear-time temporal logic. Our goal is to express transition sy…☆22Mar 29, 2026Updated 3 weeks ago
- ☆24Oct 31, 2021Updated 4 years ago
- ☆11Jan 8, 2025Updated last year
- Fast and Precise On-the-fly Patch Validation for All☆10Feb 24, 2023Updated 3 years ago
- Lifting network implementation to precise format specification☆23Apr 21, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- WhiteFox: White-Box Compiler Fuzzing Empowered by Large Language Models (OOPSLA 2024)☆82Aug 5, 2025Updated 8 months ago
- Static program analysis framework for Ethereum smart contract bytecode.☆169Apr 13, 2026Updated last week
- Robustness benchmark for DNN models.☆66Aug 8, 2022Updated 3 years ago
- A project to compute all kinds of descriptors for those software products(e.g. LOC, McCabe, Halstead).☆11Mar 20, 2017Updated 9 years ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Feb 15, 2024Updated 2 years ago
- Deepmark AI enables a unique testing environment for language models (LLM) assessment on task-specific metrics and on your own data so yo…☆104Nov 24, 2023Updated 2 years ago
- A Static Differential Analysis Tool of Network Protocol Parsers☆28Feb 21, 2024Updated 2 years ago