tsinghua-fib-lab / World-ModelView external linksLinks
[ACM CSUR 2025] Understanding World or Predicting Future? A Comprehensive Survey of World Models
☆467Nov 18, 2025Updated 2 months ago
Alternatives and similar repositories for World-Model
Users that are interested in World-Model are comparing it to the libraries listed below
Sorting:
- The official implementation of the manuscript Learning the complexity of urban mobility with deep generative collaboration network.☆17Jan 23, 2024Updated 2 years ago
- Official implementation of "Diffusion Language Models Know the Answer Before Decoding"☆45Sep 8, 2025Updated 5 months ago
- ☆11Aug 17, 2025Updated 5 months ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- Learning Dynamic Generator Model by Alternating Back-Propagation Through Time☆11Dec 26, 2018Updated 7 years ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆34Jan 16, 2026Updated last month
- DreamGaussian with 2D-GS☆12Oct 10, 2024Updated last year
- (ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training☆13Feb 15, 2025Updated last year
- A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and A…☆1,224Updated this week
- [KDD 2025] MM-Path: Multi-modal, Multi-granularity Path Representation Learning.☆16Jan 9, 2025Updated last year
- Official Implementation of Paper: WMPO: World Model-based Policy Optimization for Vision-Language-Action Models☆150Jan 4, 2026Updated last month
- The official implementation of "LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation"☆22Apr 22, 2025Updated 9 months ago
- Evaluation toolkit of the informative tracking benchmark comprising 9 scenarios, 180 diverse videos, and new challenges.☆17Dec 14, 2021Updated 4 years ago
- [WWW 2025] The dataset of the paper "A Large-scale Dataset with Behavior, Attributes, and Content of Mobile Short-video Platform"☆50Jan 29, 2026Updated 2 weeks ago
- Hands-On Image Processing with Python, Second Edition, Published by Packt☆26Updated this week
- ☆17Aug 6, 2023Updated 2 years ago
- a naive 3d human pose editor GUI.☆20Jul 12, 2023Updated 2 years ago
- ☆21Apr 16, 2024Updated last year
- YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection☆20Mar 4, 2025Updated 11 months ago
- ☆21May 3, 2025Updated 9 months ago
- ☆18Updated this week
- ☆496Oct 30, 2025Updated 3 months ago
- Sys2Bench is a benchmarking suite designed to evaluate reasoning and planning capabilities of large language models across algorithmic, l…☆29Mar 5, 2025Updated 11 months ago
- [ECCV 2024] PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance☆23Jul 25, 2024Updated last year
- [ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image☆55Sep 19, 2024Updated last year
- [arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies☆59Feb 6, 2026Updated last week
- [ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)☆39Sep 8, 2025Updated 5 months ago
- Encourage Medical LLM to engage in deep thinking similar to DeepSeek-R1.☆26Apr 24, 2025Updated 9 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Feb 8, 2026Updated last week
- VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning☆35Jul 15, 2025Updated 7 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Aug 20, 2025Updated 5 months ago
- ☆38Jan 8, 2026Updated last month
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- MotionSight's official code implementation.☆45Sep 26, 2025Updated 4 months ago
- ☆178Oct 22, 2024Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆32Jan 23, 2025Updated last year
- [AAAI 2026] GenMAC for Compositional Text-to-Video Generation☆32Jan 10, 2026Updated last month
- A curated list of awesome papers for reconstructing 4D spatial intelligence from video. (arXiv 2507.21045)☆436Jan 23, 2026Updated 3 weeks ago
- Collect some World Models for Autonomous Driving (and Robotic, etc.) papers.☆1,837Feb 1, 2026Updated 2 weeks ago