[AAAI 2026] Data and Code for Paper IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks
☆40Nov 24, 2025Updated 3 months ago
Alternatives and similar repositories for IS-Bench
Users that are interested in IS-Bench are comparing it to the libraries listed below
Sorting:
- Benchmarking Physical Risk Awareness of Foundation Model-based Embodied AI Agents☆23Nov 28, 2024Updated last year
- [EMNLP 2025] The code repo of paper "X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Com…☆39Nov 24, 2025Updated 3 months ago
- [ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆54Jul 21, 2025Updated 7 months ago
- Responsible Robotic Manipulation☆16Aug 31, 2025Updated 6 months ago
- ☆23Oct 30, 2025Updated 4 months ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond☆345Jan 22, 2026Updated last month
- MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation☆58Nov 10, 2025Updated 3 months ago
- ☆30May 22, 2024Updated last year
- Official code for paper: Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language☆30Feb 28, 2025Updated last year
- Code for "Adversarial Illusions in Multi-Modal Embeddings"☆31Aug 4, 2024Updated last year
- ☆55Feb 2, 2026Updated 3 weeks ago
- The code for paper entitled "Data-Driven Modulation Optimization with LMMSE Equalization for Reliability Enhancement in Underwater Acoust…☆19Oct 4, 2025Updated 4 months ago
- ☆64Jul 14, 2025Updated 7 months ago
- [ICCV 2025] RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints☆108Sep 2, 2025Updated 5 months ago
- Dynamic, high-resolution poverty measurement in data-scarce environments☆10Dec 8, 2024Updated last year
- A virtual clinical environment for self‑evolving LLM diagnostic agents.☆94Feb 12, 2026Updated 2 weeks ago
- [EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"☆15Aug 26, 2025Updated 6 months ago
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- Empowering Data Driven insights through hands-on projects, SQL challenges and practical tools.☆24Jan 25, 2026Updated last month
- Corpus to accompany: "Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding"☆11Apr 11, 2025Updated 10 months ago
- Implementation about a recommender System using RQ-VAE Semantic IDs☆16Aug 11, 2025Updated 6 months ago
- ☆24Oct 9, 2025Updated 4 months ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- code for AAAI accepted paper Similarity Distribution based Membership Inference Attack on Person Re-Identification.☆11Sep 29, 2024Updated last year
- ☆10Nov 28, 2023Updated 2 years ago
- 🔥 [NeurIPS 2024] A Cat Is A Cat (Not A Dog!): Unraveling Information Mix-ups in Text-to-Image Encoders through Causal Analysis and Embed…☆13Jun 21, 2025Updated 8 months ago
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10May 9, 2024Updated last year
- Data oriented programming language for game developers☆23May 4, 2021Updated 4 years ago
- 程序员延寿指南 | A programmer's guide to live longer☆18Jan 30, 2024Updated 2 years ago
- Mine conversations from novels in Project Gutenberg, to generate data for data-driven dialogue systems.☆15May 7, 2019Updated 6 years ago
- [NeurIPS 2025] LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents☆27Updated this week
- ☆31Sep 19, 2025Updated 5 months ago
- The officalimplement of dLLM-Factory☆26Jul 12, 2025Updated 7 months ago
- ☆11Sep 1, 2024Updated last year
- ☆24Feb 2, 2026Updated 3 weeks ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 2 months ago
- ☆11Nov 30, 2025Updated 3 months ago
- Code for MME-SID accepted to CIKM 2025 Full Research track.☆27Oct 29, 2025Updated 4 months ago
- Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures☆30Jan 29, 2026Updated last month