shengyin1224 / SafeAgentBenchLinks
Codes for paper "SafeAgentBench: A Benchmark for Safe Task Planning of \\ Embodied LLM Agents"
☆62Updated 11 months ago
Alternatives and similar repositories for SafeAgentBench
Users that are interested in SafeAgentBench are comparing it to the libraries listed below
Sorting:
- Official repo of Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics☆63Updated 5 months ago
- [NeurIPS 2025 Spotlight] Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning.☆108Updated 2 weeks ago
- This is the official repository for the ICLR 2025 accepted paper Badrobot: Manipulating Embodied LLMs in the Physical World.☆40Updated 7 months ago
- Official Implementation of FLARE (AAAI'25 Oral)☆28Updated 2 months ago
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆62Updated last year
- Focused on the safety and security of Embodied AI☆93Updated last month
- LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents (ICLR 2024)☆86Updated 7 months ago
- ☆21Updated 6 months ago
- HAZARD challenge☆37Updated 9 months ago
- Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)☆278Updated 10 months ago
- Responsible Robotic Manipulation☆15Updated 5 months ago
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆260Updated 3 months ago
- ICLR 2025 Agent-Related Papers☆75Updated last year
- A toolbox for benchmarking Multimodal LLM Agents trustworthiness across truthfulness, controllability, safety and privacy dimensions thro…☆62Updated 3 weeks ago
- ☆51Updated 11 months ago
- [ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"☆30Updated 7 months ago
- Evaluate Multimodal LLMs as Embodied Agents☆57Updated 11 months ago
- Benchmarking Physical Risk Awareness of Foundation Model-based Embodied AI Agents☆23Updated last year
- A vision-language-safety action architecture, named AEGIS, which contains a plug-and-play safety constraint layer formulated via control …☆50Updated last month
- [ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models☆211Updated 10 months ago
- [AAAI 2026] Data and Code for Paper IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks☆39Updated 2 months ago
- Official Implementation of ReALFRED (ECCV'24)☆44Updated last year
- [ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"☆291Updated 10 months ago
- [ACL 2025] "World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning." https://arxiv.org/abs/2503.1…☆16Updated 6 months ago
- [ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆53Updated 6 months ago
- [CVPR2024] This is the official implement of MP5☆106Updated last year
- Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents☆55Updated last month
- ☆133Updated last year
- ProgPrompt for Virtualhome☆146Updated 2 years ago
- [ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning☆195Updated last year