shengyin1224 / SafeAgentBench
Codes for paper "SafeAgentBench: A Benchmark for Safe Task Planning of \\ Embodied LLM Agents"
☆34Updated 2 months ago
Alternatives and similar repositories for SafeAgentBench
Users that are interested in SafeAgentBench are comparing it to the libraries listed below
Sorting:
- Official repo of Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics☆25Updated last month
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆57Updated 7 months ago
- [ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"☆15Updated 2 months ago
- HAZARD challenge☆32Updated 2 weeks ago
- This is the official repository for the ICLR 2025 accepted paper Badrobot: Manipulating Embodied LLMs in the Physical World.☆20Updated 2 months ago
- ☆43Updated 3 months ago
- ☆38Updated last month
- Official Implementation of ReALFRED (ECCV'24)☆39Updated 7 months ago
- ICLR 2025 Agent-Related Papers☆67Updated 6 months ago
- ☆128Updated 10 months ago
- Official code release of AAAI 2024 paper SayCanPay.☆48Updated last year
- Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)☆195Updated 2 months ago
- LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents (ICLR 2024)☆70Updated last month
- ☆32Updated 7 months ago
- Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents☆34Updated last week
- ☆16Updated 5 months ago
- Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆37Updated 2 months ago
- ☆29Updated 7 months ago
- Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"☆34Updated 2 months ago
- ☆25Updated 11 months ago
- MuMA-ToM: Multi-modal Multi-Agent Theory of Mind☆26Updated 3 months ago
- Official Implementation of CL-ALFRED (ICLR'24)☆22Updated 6 months ago
- Awesome Large Reasoning Model(LRM) Safety.This repository is used to collect security-related research on large reasoning models such as …☆63Updated this week
- ProgPrompt for Virtualhome☆133Updated last year
- [ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models☆186Updated last month
- [ICML 2024] Language Models Represent Beliefs of Self and Others☆32Updated 7 months ago
- VELMA agent for VLN in Street View☆19Updated last year
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆40Updated last year
- [CVPR2024] This is the official implement of MP5☆101Updated 10 months ago
- Implementation of the MATRIX framework (ICML 2024)☆51Updated last year