THUNLP-MT/EscapeCraft

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/THUNLP-MT/EscapeCraft)

THUNLP-MT / EscapeCraft

Official repo for EscapeCraft (an 3D environment for room escape) and benchmark MM-Escape. This work is accepted by ICCV 2025.

☆39

Alternatives and similar repositories for EscapeCraft

Users that are interested in EscapeCraft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

THUNLP-MT / CODIS
View on GitHub
Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".
☆13Oct 14, 2024Updated last year
PVIT-official / PVIT
View on GitHub
Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models
☆37Sep 19, 2023Updated 2 years ago
Tsinghua-dhy / EDC-2-RAG
View on GitHub
☆19Nov 3, 2025Updated 8 months ago
THUNLP-MT / MUSEG
View on GitHub
Repo for paper "MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding".
☆40Jun 9, 2025Updated last year
THUNLP-MT / ModelCompose
View on GitHub
Official code for our paper "Model Composition for Multimodal Large Language Models" (ACL 2024)
☆31Jan 8, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
thunlp / LEGENT
View on GitHub
Open Platform for Embodied Agents
☆342Jan 12, 2025Updated last year
neu-vi / FleVRS
View on GitHub
FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024
☆22Dec 9, 2024Updated last year
Graphic-Kiliani / M3DLayout-code
View on GitHub
[CVPR 2026 Highlight] M3DLayout-A-Multi-Source-Dataset-of-3D-Indoor-Layouts-and-Structured-Descriptions.
☆60Apr 13, 2026Updated 3 months ago
snumprlab / isr-dpo
View on GitHub
Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)
☆23Nov 25, 2025Updated 7 months ago
AgentForceTeamOfficial / UA2-Agent
View on GitHub
Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…
☆19Nov 12, 2024Updated last year
ars22 / e3
View on GitHub
☆20Sep 16, 2025Updated 10 months ago
yiwei-hu / DiffProxy
View on GitHub
Code for the paper "Node Graph Optimization Using Differentiable Proxies"
☆15Dec 21, 2022Updated 3 years ago
adobe-research / ProcMatRL
View on GitHub
☆15Feb 24, 2025Updated last year
TingtingLiao / unique3d_diffuser
View on GitHub
☆16Sep 30, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
neu-vi / struct2d
View on GitHub
Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)
☆31Oct 28, 2025Updated 8 months ago
richard-guyunqi / BlenderGym-Open
View on GitHub
☆40Jul 8, 2025Updated last year
THUNLP-MT / SKR
View on GitHub
Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)
☆27Dec 8, 2023Updated 2 years ago
UCSB-AI / MMWorld
View on GitHub
Official repo of the ICLR 2025 paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"
☆28Jul 15, 2025Updated last year
THUNLP-MT / ActiView
View on GitHub
☆11Dec 20, 2024Updated last year
WadeYin9712 / UI-Simulator
View on GitHub
Code for 🌍 UI-Simulator: LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training
☆21Oct 17, 2025Updated 9 months ago
linjh1118 / WisdoMentor
View on GitHub
WisdoMentor - Series: A LLM for undergraduates | 博导智言(辅助大学生学习)
☆13May 9, 2024Updated 2 years ago
tsinghua-fib-lab / RoboScape
View on GitHub
☆26Jun 29, 2025Updated last year
TsinghuaJunYin / FloorPlan-LLaMa
View on GitHub
☆24May 26, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
princetonvisualai / imagecaptioning-bias
View on GitHub
Code for the paper "Understanding and Evaluating Racial Biases in Image Captioning"
☆12Mar 26, 2026Updated 3 months ago
linjh1118 / Llama3-Chinese-ORPO
View on GitHub
基于Llama3，通过进一步CPT，SFT，ORPO得到的中文版Llama3
☆16Apr 24, 2024Updated 2 years ago
GigaAI-research / WonderFree
View on GitHub
☆19Jun 26, 2025Updated last year
ByungKi-K / JointDiT-code
View on GitHub
JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers
☆17Jul 21, 2025Updated last year
NEUIR / MemGraph
View on GitHub
[SIGIR '25] This is the code repo for our SIGIR '25 paper: Enhancing the Patent Matching Capability of Large Language Models via Memory G…
☆19Apr 22, 2025Updated last year
cvlab-kaist / URECA
View on GitHub
Official implementation of "URECA : Unique Region Caption Anything"
☆58Jul 13, 2025Updated last year
LightChen233 / M3CoT
View on GitHub
☆92Mar 12, 2026Updated 4 months ago
linjh1118 / Chinese_Awesome_CV
View on GitHub
Awesome_CV的中文版本，clone本项目到overleaf即可轻松愉快编写自己的CV
☆18May 24, 2024Updated 2 years ago
yonseivnl / vlm-rlaif
View on GitHub
ACL'24 (Oral) Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback
☆77Sep 12, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Xiaohan-Chen / eat_pytorch_in_20_days
View on GitHub
Pytorch🍊🍉 is delicious, just eat it! 😋😋
☆10Feb 13, 2026Updated 5 months ago
fdyuandong / ReplicaPano-Dataset
View on GitHub
This repo contains visualization code of our ReplicaPano Dataset.
☆19Feb 7, 2025Updated last year
whiteinblue / EarthCrafter
View on GitHub
☆40Mar 17, 2026Updated 4 months ago
THUNLP-MT / StreamingBench
View on GitHub
StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding
☆167May 16, 2025Updated last year
Heng14 / DyLiN
View on GitHub
Source code for CVPR 2023 DyLiN paper
☆22Dec 14, 2023Updated 2 years ago
yuhui-zh15 / AutoConverter
View on GitHub
Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…
☆40May 26, 2025Updated last year
Carlos-Mero / HaiShangHua
View on GitHub
A visual novel made with Godot Engine.
☆11Sep 18, 2023Updated 2 years ago