Official repo for EscapeCraft (an 3D environment for room escape) and benchmark MM-Escape. This work is accepted by ICCV 2025.
☆36Jul 7, 2025Updated 7 months ago
Alternatives and similar repositories for EscapeCraft
Users that are interested in EscapeCraft are comparing it to the libraries listed below
Sorting:
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models☆37Sep 19, 2023Updated 2 years ago
- Official code for our paper "Model Composition for Multimodal Large Language Models" (ACL 2024)☆31Jan 8, 2025Updated last year
- ☆28Jul 14, 2025Updated 7 months ago
- For Ego4D VQ3D Task☆22Jan 9, 2024Updated 2 years ago
- [ICCV 2025] ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models☆49Jul 7, 2025Updated 7 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- ☆73Feb 14, 2026Updated 2 weeks ago
- ☆10Jun 2, 2022Updated 3 years ago
- On the Robustness of GUI Grounding Models Against Image Attacks☆12Apr 8, 2025Updated 10 months ago
- ☆12Updated this week
- WisdoMentor - Series: A LLM for undergraduates | 博导智言(辅助大学生 学习)☆13May 9, 2024Updated last year
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆42May 3, 2025Updated 9 months ago
- Official implementation of Recurrent Action Transformer with Memory, an offline RL agent with memory mechanisms. https://sites.google.com…☆18Nov 23, 2025Updated 3 months ago
- ☆12Dec 20, 2024Updated last year
- Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆15Nov 18, 2025Updated 3 months ago
- PyTorch Implementation of "Mask2Hand: Learning to Predict the 3D Hand Pose and Shape from Shadow"☆12Aug 19, 2024Updated last year
- Awesome latest models, datasets and benchmarks on streaming/online video understanding.☆23Oct 19, 2025Updated 4 months ago
- ☆18Mar 2, 2025Updated 11 months ago
- CArbohydrate-Protein Site IdentiFier☆15Aug 22, 2023Updated 2 years ago
- [ICLR2025] Are Large Vision Language Models Good Game Players?☆12Mar 3, 2025Updated 11 months ago
- Inverse Kinematics for MANO hands☆18Feb 23, 2022Updated 4 years ago
- ☆12Nov 2, 2021Updated 4 years ago
- Code for training a language model reaction predictor. (To accompany our paper on the OOD evaluation of reaction predictors).☆12Jan 13, 2025Updated last year
- ☆12May 19, 2021Updated 4 years ago
- ☆13Jul 22, 2022Updated 3 years ago
- Code accompanying our ICML 2020 paper on choice set optimization in group decision-making.☆11Jun 27, 2020Updated 5 years ago
- Code for paper "Rethinking Text-based Protein Understanding: Retrieval or LLM?"☆18Oct 7, 2025Updated 4 months ago
- The official implementation of the paper SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder☆18Oct 19, 2025Updated 4 months ago
- A LLM-powered agent for NetHack☆19Nov 4, 2024Updated last year
- ☆16Oct 13, 2025Updated 4 months ago
- [IEEE TMI'24] Self-Supervised Cyclic Diffeomorphic Mapping for Soft Tissue Deformation Recovery in Robotic Surgery Scenes☆10Aug 10, 2024Updated last year
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 5 months ago
- A fast CUDA accelerated implementation for MVS evaluation.☆12Dec 1, 2022Updated 3 years ago
- Official Repository of LatentSeek☆77Jun 6, 2025Updated 8 months ago
- An undergraduate thesis project.☆11Jul 13, 2024Updated last year
- ☆13Jun 9, 2020Updated 5 years ago
- ☆11Feb 5, 2024Updated 2 years ago
- ☆14May 20, 2025Updated 9 months ago