thuml / RLVR-WorldLinks
Official repository for "RLVR-World: Training World Models with Reinforcement Learning", https://arxiv.org/abs/2505.13934
☆70Updated last month
Alternatives and similar repositories for RLVR-World
Users that are interested in RLVR-World are comparing it to the libraries listed below
Sorting:
- ☆47Updated 2 months ago
- ☆69Updated 2 weeks ago
- AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆76Updated last month
- ☆109Updated 3 months ago
- Official implementation of paper "ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting" (CVPR 2025)☆42Updated 3 months ago
- [IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…☆91Updated last month
- [CVPR2024] This is the official implement of MP5☆103Updated last year
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆163Updated 3 weeks ago
- A repo for open research on building large reasoning models☆84Updated this week
- ☆50Updated last month
- Paper collections of the continuous effort start from World Models.☆179Updated last year
- Bayes-Adaptive RL for LLM Reasoning☆36Updated 2 months ago
- ☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models☆19Updated 2 months ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆99Updated 2 months ago
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆66Updated 10 months ago
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆66Updated 2 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"☆134Updated 2 months ago
- ☆188Updated this week
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆29Updated 8 months ago
- ☆41Updated last month
- G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning☆77Updated 2 months ago
- Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)☆115Updated this week
- Evaluate Multimodal LLMs as Embodied Agents☆52Updated 5 months ago
- ☆20Updated 3 weeks ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆105Updated 2 months ago
- ☆45Updated 7 months ago
- This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR2025]☆73Updated last month
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆40Updated 3 weeks ago
- Multimodal RewardBench☆42Updated 5 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆43Updated 3 weeks ago