mingyin1/Agents_Failure_Attribution

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mingyin1/Agents_Failure_Attribution)

mingyin1 / Agents_Failure_Attribution

Benchmark for automated failure attributions in agentic systems (🏆 ICML 2025 Spotlight)

☆24

Alternatives and similar repositories for Agents_Failure_Attribution

Users that are interested in Agents_Failure_Attribution are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TraceElephant / TraceElephant
View on GitHub
Repo of "Seeing the Whole Elephant: A Benchmark for Failure Attribution in LLM-based Multi-Agent Systems" (ACL 2026)
☆16Apr 27, 2026Updated 2 months ago
JinLi-i / MoDiCF
View on GitHub
The source code of [WWW 2025] MoDiCF
☆16Mar 26, 2026Updated 3 months ago
stefanhgm / patient_summaries_with_llms
View on GitHub
Code for "A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models"
☆17Jul 20, 2025Updated 11 months ago
zzh-thu-22 / ExtendAttack
View on GitHub
[AAAI 2026] This is the official implementation of the paper "ExtendAttack: Attacking Servers of LRMs via Extending Reasoning".
☆25Mar 18, 2026Updated 3 months ago
RulinShao / RAG-evaluation-harnesses
View on GitHub
An evaluation suite for Retrieval-Augmented Generation (RAG).
☆24Apr 26, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ncsu-dk-lab / Acc-DD
View on GitHub
☆14Apr 21, 2023Updated 3 years ago
MadryLab / D3M
View on GitHub
Debiasing Through Data Attribution
☆13May 23, 2024Updated 2 years ago
wbopan / flashtrace
View on GitHub
Efficient multi-token attribution for reasoning language models — Python package, CLI, and HTML token traces
☆29Jul 3, 2026Updated last week
CogComp / faithful_summarization
View on GitHub
☆18May 5, 2021Updated 5 years ago
Lichang-Chen / AlpaGasus
View on GitHub
A better Alpaca Model Trained with Less Data (only 9k instructions of the original set)
☆24Jul 26, 2024Updated last year
ZNLP / Language-Imbalance-Driven-Rewarding
View on GitHub
[ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving
☆25Apr 6, 2026Updated 3 months ago
Percent-BFD / neurips_submission
View on GitHub
☆17Nov 23, 2023Updated 2 years ago
xxiqiao / TROJail
View on GitHub
Official implementation of "TROJail: Trajectory-Level Optimization for Multi-Turn Large Language Model Jailbreaks with Process Rewards"
☆30Updated this week
WikiChao / DAVIS
View on GitHub
[🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …
☆33Mar 30, 2026Updated 3 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
Secbrain / RIDS
View on GitHub
☆11Oct 7, 2023Updated 2 years ago
solislemuslab / tropical-stethoscope
View on GitHub
Classification of animal sounds in a hyperdiverse rainforest using Convolutional Neural Networks (Sun et al, 2021)
☆13Oct 16, 2023Updated 2 years ago
Tongsuo-Project / tsapp
View on GitHub
基于铜锁密码库开发的商用密码工具箱桌面应用程序
☆12Feb 5, 2025Updated last year
UCSC-REAL / FLAT
View on GitHub
[ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data
☆14Feb 26, 2025Updated last year
caiqizh / LUQ
View on GitHub
☆14Jan 14, 2026Updated 6 months ago
peng-gao-lab / p4control
View on GitHub
P4Control: Line-Rate Cross-Host Attack Prevention via In-Network Information Flow Control Enabled by Programmable Switches and eBPF
☆11May 20, 2024Updated 2 years ago
Zhudongsheng75 / Divide-Then-Aggregate
View on GitHub
(ACL 2025) Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation
☆12May 21, 2025Updated last year
SoonyangZhang / tcp-congestion-mininet
View on GitHub
test tcp congestion fairness on mininet
☆10Aug 18, 2020Updated 5 years ago
XMUDeepLIT / TTCS
View on GitHub
The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.
☆50Apr 22, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HReynaud / EchoNet-Synthetic
View on GitHub
MICCAI 2024 code for the paper: EchoNet-Synthetic: Privacy-preserving Video Generation for Safe Medical Data Sharing. EchoNet-Synthetic i…
☆41Jun 16, 2025Updated last year
MraDonkey / DMAD
View on GitHub
[ICLR 2025] Breaking Mental Set to Improve Reasoning through Diverse Multi-Agent Debate
☆25Apr 22, 2025Updated last year
2654400439 / H123-Website-Fingerprinting
View on GitHub
The code and dataset for the paper HOLMES & WATSON: A Robust and Lightweight HTTPS Website Fingerprinting through HTTP Version Parallelis…
☆16May 30, 2025Updated last year
ZBox1005 / AgentForesight
View on GitHub
AgentForesight: Online Auditing for Early Failure Prediction in Multi-Agent Systems
☆15May 12, 2026Updated 2 months ago
xiaohanzhang2005 / Minor-Detection
View on GitHub
Self-evolving minor-user identification agent for anthropomorphic AI interaction, with trigger evaluation, evidence chains, and deployabl…
☆20Apr 13, 2026Updated 3 months ago
kennethorq / SMORE
View on GitHub
[WSDM 2025] Source code for "Spectrum-based Modality Representation Fusion Graph Convolutional Network for Multimodal Recommendation".
☆36Dec 22, 2024Updated last year
qing-yuan233 / RMCBench
View on GitHub
enchmarking Large Language Models' Resistance to Malicious Code
☆19Apr 23, 2026Updated 2 months ago
Wangyuhao06 / IKEA
View on GitHub
Implement of Implicit Knowledge Extraction Attack.
☆24Apr 17, 2026Updated 2 months ago
juangamella / icp
View on GitHub
Python implementation of the Invariant Causal Prediction (ICP) algorithm, from the 2015 paper "Causal inference using invariant predictio…
☆26Feb 15, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TURuibo / Neuropathic-Pain-Diagnosis-Simulator
View on GitHub
Neuropathic Pain Diagnosis Simulator
☆14Jul 6, 2023Updated 3 years ago
levyisthebest / ECHOPulse_Prelease
View on GitHub
The Pre-lease github repository of ECHOPULSE: ECG CONTROLLED ECHOCARDIO- GRAMS VIDEO GENERATION
☆47Feb 4, 2025Updated last year
tmlr-group / TriMem
View on GitHub
[arXiv:2605.19952] "Rethinking How to Remember: Beyond Atomic Facts in Lifelong LLM Agent Memory"
☆16May 20, 2026Updated last month
Shangshu-LAB / MM4flow
View on GitHub
☆22Jan 19, 2026Updated 5 months ago
rxtan2 / AVSeT
View on GitHub
☆17Oct 2, 2023Updated 2 years ago
zehao-dong / PACE
View on GitHub
☆18Dec 30, 2023Updated 2 years ago
collinzrj / adversarial_decoding
View on GitHub
☆29Oct 27, 2025Updated 8 months ago