thuml / RLVR-WorldLinks

Official repository for "RLVR-World: Training World Models with Reinforcement Learning", https://arxiv.org/abs/2505.13934

☆70

Alternatives and similar repositories for RLVR-World

Users that are interested in RLVR-World are comparing it to the libraries listed below

Sorting:

Gabesarch / ICAL
☆47Updated 2 months ago
Gabesarch / grounded-rl
☆69Updated 2 weeks ago
ASTRAL-Group / AlphaOne
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
☆76Updated last month
DigiRL-agent / digiq
☆109Updated 3 months ago
CraftJarvis / ROCKET-1
Official implementation of paper "ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting" (CVPR 2025)
☆42Updated 3 months ago
Zhoues / MineDreamer
[IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…
☆91Updated last month
IranQin / MP5
[CVPR2024] This is the official implement of MP5
☆103Updated last year
EmbodiedBench / EmbodiedBench
[ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.
☆163Updated 3 weeks ago
LLM360 / Reasoning360
A repo for open research on building large reasoning models
☆84Updated this week
yunfeixie233 / ViGaL
☆50Updated last month
Timothyxxx / WorldModelPapers
Paper collections of the continuous effort start from World Models.
☆179Updated last year
shenao-zhang / BARL
Bayes-Adaptive RL for LLM Reasoning
☆36Updated 2 months ago
linhaowei1 / kumo
☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models
☆19Updated 2 months ago
dvlab-research / ARPO
Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay
☆99Updated 2 months ago
thuml / ContextWM
Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…
☆66Updated 10 months ago
USC-GVL / PhysBench
[ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …
☆66Updated 2 months ago
TIGER-AI-Lab / VL-Rethinker
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"
☆134Updated 2 months ago
ML-GSAI / LLaDA-V
☆188Updated this week
sled-group / moh
[NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models
☆29Updated 8 months ago
zhijie-group / R1-Zero-VSI
☆41Updated last month
chenllliang / G1
G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
☆77Updated 2 months ago
UMass-Embodied-AGI / Mirage
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)
☆115Updated this week
thunlp / EmbodiedEval
Evaluate Multimodal LLMs as Embodied Agents
☆52Updated 5 months ago
VisuLogic-Benchmark / VisuLogic-Train
☆20Updated 3 weeks ago
RUCAIBox / Virgo
Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
☆105Updated 2 months ago
si0wang / VisVM
☆45Updated 7 months ago
TIGER-AI-Lab / MEGA-Bench
This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR2025]
☆73Updated last month
kokolerk / TON
Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models
☆40Updated 3 weeks ago
facebookresearch / multimodal_rewardbench
Multimodal RewardBench
☆42Updated 5 months ago
sail-sg / AnytimeReasoner
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
☆43Updated 3 weeks ago