THUNLP-MT / EscapeCraftLinks
Official repo for EscapeCraft (an 3D environment for room escape) and benchmark MM-Escape
☆15Updated last week
Alternatives and similar repositories for EscapeCraft
Users that are interested in EscapeCraft are comparing it to the libraries listed below
Sorting:
- ☆24Updated 3 months ago
- ☆41Updated this week
- ☆74Updated last year
- Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…☆126Updated this week
- TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos☆42Updated 2 weeks ago
- [Blog 1] Recording a bug of grpo_trainer in some R1 projects☆20Updated 3 months ago
- A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.☆63Updated 2 months ago
- A Self-Training Framework for Vision-Language Reasoning☆80Updated 4 months ago
- Official repository of MMDU dataset☆91Updated 8 months ago
- Collections of Papers and Projects for Multimodal Reasoning.☆105Updated last month
- [LLaVA-Video-R1]✨First Adaptation of R1 to LLaVA-Video (2025-03-18)☆28Updated 3 weeks ago
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models☆63Updated 10 months ago
- ☆101Updated last month
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆85Updated 9 months ago
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆55Updated 9 months ago
- Envolving Temporal Reasoning Capability into LMMs via Temporal Consistent Reward☆35Updated 2 months ago
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.☆22Updated 4 months ago
- ⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆157Updated 2 weeks ago
- ☆147Updated 7 months ago
- The official implement of "Grounded Chain-of-Thought for Multimodal Large Language Models"☆11Updated 2 months ago
- ☆14Updated last year
- Official implement of MIA-DPO