sani903 / OpenAgentSafetyLinks
A Framework for Evaluating AI Agent Safety in Realistic Environments
☆28Updated 3 months ago
Alternatives and similar repositories for OpenAgentSafety
Users that are interested in OpenAgentSafety are comparing it to the libraries listed below
Sorting:
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆19Updated 9 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆17Updated 3 months ago
- ☆29Updated last month
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆22Updated 7 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆17Updated 10 months ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Updated 4 months ago
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)☆17Updated 8 months ago
- ☆16Updated last year
- Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups☆50Updated last year
- Control LLM☆22Updated 9 months ago
- ☆23Updated last year
- ☆52Updated 8 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Updated 7 months ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆16Updated last year
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆13Updated 10 months ago
- ☆51Updated 8 months ago
- CS194-196 Course Project☆14Updated 11 months ago
- ☆14Updated last year
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆32Updated 5 months ago
- ☆84Updated last year
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Updated 3 months ago
- Code and Data for "FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation" (ACL25)☆28Updated 3 months ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆40Updated 3 months ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆17Updated last year
- ☆30Updated 4 months ago
- MegaRAG: Multimodal Graph-based RAG☆27Updated 4 months ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Updated 7 months ago
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆36Updated 11 months ago