shengyin1224 / SafeAgentBench
Codes for paper "SafeAgentBench: A Benchmark for Safe Task Planning of \\ Embodied LLM Agents"
☆25Updated last month
Alternatives and similar repositories for SafeAgentBench:
Users that are interested in SafeAgentBench are comparing it to the libraries listed below
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆55Updated 6 months ago
- ☆27Updated 3 weeks ago
- Official repo of Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics☆19Updated 2 weeks ago
- Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆35Updated 3 weeks ago
- ICLR 2025 Agent-Related Papers☆59Updated 4 months ago
- ☆43Updated 2 months ago
- Official Implementation of ReALFRED (ECCV'24)☆39Updated 5 months ago
- Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)☆183Updated last month
- LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents (ICLR 2024)☆69Updated 3 weeks ago
- HAZARD challenge☆29Updated 3 weeks ago
- [ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"☆13Updated last month
- ☆126Updated 8 months ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆192Updated last week
- This is the official repository for the ICLR 2025 accepted paper Badrobot: Manipulating Embodied LLMs in the Physical World.☆18Updated last month
- [arXiv 2023] Embodied Task Planning with Large Language Models☆179Updated last year
- Official code release of AAAI 2024 paper SayCanPay.☆46Updated last year
- ProgPrompt for Virtualhome☆132Updated last year
- Official Implementation of CL-ALFRED (ICLR'24)☆22Updated 5 months ago
- ☆31Updated 5 months ago
- RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints☆32Updated last week
- An implementation for MLLM oversensitivity evaluation☆13Updated 4 months ago
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆39Updated last year
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆102Updated this week
- [CVPR2024] This is the official implement of MP5☆99Updated 9 months ago
- Summaries of ICML 2024 papers☆10Updated 8 months ago
- Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"☆29Updated last month
- Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆100Updated this week
- [ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"☆154Updated 3 months ago
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆79Updated 6 months ago
- All about Robotics and AI Agents you need are here☆28Updated 11 months ago