Awesome material(papers, tools, etc.) about testing machine learning system, including deep learning system.
☆47Oct 12, 2021Updated 4 years ago
Alternatives and similar repositories for awesome-ml-testing
Users that are interested in awesome-ml-testing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- EvalDNN: A Toolbox for Evaluating Deep Neural Network Models☆14Mar 9, 2020Updated 6 years ago
- Taxonomy of Real Faults in Deep Learning Systems☆15Jan 27, 2020Updated 6 years ago
- A program analysis, verification, and optimization framework☆30Jun 22, 2026Updated last week
- PL/SE conference deadline countdowns☆18Nov 23, 2020Updated 5 years ago
- A Clone-Based Approach for Recommending Modification on Pasted Code☆12Jun 10, 2017Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Pattern Fuzzing for Worst-Case Algorithmic Complexity using Program Synthesis☆20Aug 24, 2021Updated 4 years ago
- ☆12Jul 21, 2023Updated 2 years ago
- LLM evaluation.☆16Nov 7, 2023Updated 2 years ago
- Structure-Invariant Testing for Machine Translation [ICSE'20]☆16Dec 17, 2020Updated 5 years ago
- ☆19Jun 25, 2025Updated last year
- Indexing reachability for context-sensitive data flow analysis.☆12Jul 10, 2022Updated 3 years ago
- Java Ranger is a path-merging extension of Symbolic PathFinder☆16Mar 16, 2026Updated 3 months ago
- A benchmark suite (under construction) for smart contract vulnerability tools☆17Jul 13, 2021Updated 4 years ago
- A fuzzer for SMT solvers☆21May 8, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This is the implement repository of our upcoming ESEC/FSE 2020 paper: Deep Learning Library Testing via Effective Model Generation.☆56Oct 29, 2023Updated 2 years ago
- TVMFuzz: fuzzing tensor-level intermediate representation in TVM☆32May 24, 2020Updated 6 years ago
- This repository contains the implementation and the evaluation of our ESEC/FSE 2020 paper: Detecting Numerical Bugs in Neural Network Ar…☆25Dec 17, 2020Updated 5 years ago
- Automated DNN generation for fuzz testing and more☆149Jan 14, 2025Updated last year
- 北大树洞爬虫☆11Jul 30, 2020Updated 5 years ago
- Guiding Program Synthesis by Learning to Generate Examples☆13Jul 23, 2023Updated 2 years ago
- A random Solidity program generator.☆134Jan 4, 2026Updated 5 months ago
- Checks the PDFs submitted to a conference, e.g., for formatting violations and double anonymous violations☆69Dec 11, 2021Updated 4 years ago
- GenCoG: A DSL-Based Approach to Generating Computation Graphs for TVM Testing (ISSTA‘23)☆17Jul 19, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Concolic Testing for Deep Neural Networks☆118Jul 16, 2021Updated 4 years ago
- A complete guide to evaluate LLMs and RAGs. Both theory and code based approaches covered.☆28Nov 16, 2023Updated 2 years ago
- ☆23Mar 20, 2021Updated 5 years ago
- ☆15Jan 23, 2020Updated 6 years ago
- A model checker and assume/guarantee contract generator for Lustre programs.☆15Jun 5, 2018Updated 8 years ago
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆36Jul 2, 2024Updated last year
- Characterizing Transaction-Reverting Statements in Ethereum Smart Contracts.☆11Sep 1, 2021Updated 4 years ago
- Analyze execution trace to find regression bug☆41Jun 2, 2024Updated 2 years ago
- Libra is a static analyzer for certifying fairness of feed-forward neural network classifiers of tabular data.☆24Oct 31, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Fast and Precise On-the-fly Patch Validation for All☆10Feb 24, 2023Updated 3 years ago
- Lifting network implementation to precise format specification☆23Apr 21, 2025Updated last year
- WhiteFox: White-Box Compiler Fuzzing Empowered by Large Language Models (OOPSLA 2024)☆83Aug 5, 2025Updated 10 months ago
- Static program analysis framework for Ethereum smart contract bytecode.☆168Apr 13, 2026Updated 2 months ago
- ☆10Dec 13, 2021Updated 4 years ago
- Website for Learning from "Big Code"☆30Jun 19, 2021Updated 5 years ago
- Artifact for IEEE Security and Privacy 2022 paper: "SoK: Demystifying Binary Lifters Through the Lens of Downstream Applications"☆29Jul 29, 2022Updated 3 years ago