chanchimin / AgentMonitorLinks
Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"
☆10Updated 7 months ago
Alternatives and similar repositories for AgentMonitor
Users that are interested in AgentMonitor are comparing it to the libraries listed below
Sorting:
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆19Updated 3 weeks ago
- ☆39Updated 5 months ago
- ☆46Updated 2 months ago
- [ACL 2025] Knowledge Unlearning for Large Language Models☆39Updated 2 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated last week
- ☆30Updated 3 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆24Updated last month
- ☆47Updated 5 months ago
- A holistic benchmark for LLM abstention☆38Updated last week
- Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆93Updated this week
- ☆15Updated 10 months ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆12Updated 3 weeks ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆26Updated 4 months ago
- ☆13Updated 7 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆64Updated last month
- Resa: Transparent Reasoning Models via SAEs☆39Updated last month
- ☆24Updated 9 months ago
- ☆45Updated last month
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆27Updated 4 months ago
- ☆22Updated last year
- ☆36Updated last month
- ☆19Updated 4 months ago
- The official implementation of Preference Data Reward-Augmentation.☆17Updated 2 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆46Updated 4 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated 4 months ago
- ☆20Updated 2 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆27Updated this week
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆40Updated last month
- Official implementation of "PyVision: Agentic Vision with Dynamic Tooling."☆24Updated last week
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year