thuml / RLVR-WorldLinks
Official repository for "RLVR-World: Training World Models with Reinforcement Learning", https://arxiv.org/abs/2505.13934
☆45Updated 2 weeks ago
Alternatives and similar repositories for RLVR-World
Users that are interested in RLVR-World are comparing it to the libraries listed below
Sorting:
- G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning☆64Updated last month
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆135Updated 2 weeks ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆78Updated 3 weeks ago
- Official implementation of paper "ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting" (CVPR 2025)☆41Updated 2 months ago
- ☆42Updated last month
- AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆63Updated 2 weeks ago
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆64Updated 8 months ago
- ☆106Updated 2 months ago
- ☆44Updated 5 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆36Updated 7 months ago
- [NeurIPS-2024] The offical Implementation of "Instruction-Guided Visual Masking"☆35Updated 7 months ago
- [ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding☆50Updated 6 months ago
- ☆38Updated this week
- Natural Language Reinforcement Learning☆89Updated 6 months ago
- ☆61Updated 3 months ago
- ☆49Updated last year
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆36Updated 3 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆104Updated 3 weeks ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆38Updated 3 weeks ago
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆64Updated 3 weeks ago
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆81Updated 3 weeks ago
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆57Updated 8 months ago
- ☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models☆18Updated 3 weeks ago
- ☆71Updated 6 months ago
- ☆35Updated 2 weeks ago
- Official Repository of LatentSeek☆48Updated 2 weeks ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆51Updated 6 months ago
- [IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…☆91Updated last week
- Multimodal RewardBench☆41Updated 4 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"☆118Updated 2 weeks ago