[AAAI 2026] Data and Code for Paper IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks
☆43Nov 24, 2025Updated 5 months ago
Alternatives and similar repositories for IS-Bench
Users that are interested in IS-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- All-in-One Safety Evaluation Framwork☆48Apr 21, 2026Updated last week
- [NeurIPS 2025 Spotlight] Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning.☆137Mar 31, 2026Updated 3 weeks ago
- [ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆60Jul 21, 2025Updated 9 months ago
- Responsible Robotic Manipulation☆15Aug 31, 2025Updated 7 months ago
- Headway - Selenium Maven TestNG POM Data Driven Framework☆18Jul 2, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This is the official repository for "SAFE: Multitask Failure Detection for Vision-Language-Action Models" (NeurIPS 2025)☆66Jan 18, 2026Updated 3 months ago
- Imitation learning from multiple experts☆13Aug 29, 2022Updated 3 years ago
- ☆67Jul 14, 2025Updated 9 months ago
- MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation☆70Nov 10, 2025Updated 5 months ago
- 北京大学 2024 年秋 ICS 相关资料☆13May 14, 2025Updated 11 months ago
- The repository of the paper "REEF: Representation Encoding Fingerprints for Large Language Models," aims to protect the IP of open-source…☆78Jan 16, 2025Updated last year
- The officalimplement of dLLM-Factory☆25Jul 12, 2025Updated 9 months ago
- ☆30May 22, 2024Updated last year
- A virtual clinical environment for self‑evolving LLM diagnostic agents.☆103Feb 12, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- [ICCV 2025] RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints☆124Sep 2, 2025Updated 7 months ago
- Diagnostic Framework for LLMs and MLLMs☆36Mar 2, 2026Updated last month
- Repo for paper "Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability"☆98Updated this week
- Official code of "UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models" WACV2026☆37Nov 24, 2025Updated 5 months ago
- Advanced Embodied Intelligence Brain Model☆36Nov 5, 2025Updated 5 months ago
- [NeurIPS 2025] Official repository of RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents☆117Dec 2, 2025Updated 4 months ago
- I have targeted to solve the benchmark problem in Reinforcement learning literature using Deep Q-networks with images as the only input t…☆12Dec 2, 2019Updated 6 years ago
- [Paper][EMNLP 2025] Enrich-on-Graph: Query-Graph Alignment for Complex Reasoning with LLM Enriching☆34Feb 8, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ICLR 2026] Unified Vision-Language-Action Model☆296Oct 15, 2025Updated 6 months ago
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10May 9, 2024Updated last year
- Code for ACL 2025 Main paper "Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning…☆51Aug 4, 2025Updated 8 months ago
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]☆16Sep 12, 2025Updated 7 months ago
- Table top manipulation calibration between the robot arm, the fixed cameras and the camera in hand.☆11Apr 12, 2024Updated 2 years ago
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆18Jul 17, 2023Updated 2 years ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆158May 29, 2025Updated 11 months ago
- Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence☆61Nov 11, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repository compiles a list of papers/resources related to the graph retrieval-augmented generation! Star⭐ the repo and follow me if …☆10Dec 7, 2024Updated last year
- ☆11Nov 23, 2020Updated 5 years ago
- ☆11Sep 1, 2024Updated last year
- A comprehensive framework for benchmarking single and multi-agent systems across a wide range of tasks—evaluating performance, accuracy, …☆38Nov 11, 2025Updated 5 months ago
- JoinAI是一个开源仓库,专注于算法工程能力的培养,包括工程和数学原理的整理☆11Apr 20, 2025Updated last year
- Mujoco model of the Kinova Gen3 robot☆21Feb 25, 2025Updated last year
- ☆25Dec 10, 2021Updated 4 years ago