Timothyxxx / WorldModelPapersLinks
Paper collections of the continuous effort start from World Models.
☆190Updated last year
Alternatives and similar repositories for WorldModelPapers
Users that are interested in WorldModelPapers are comparing it to the libraries listed below
Sorting:
- ☆133Updated last year
- [ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning☆191Updated 11 months ago
- Code for Contrastive Preference Learning (CPL)☆177Updated last year
- Training VLM agents with multi-turn reinforcement learning☆338Updated last week
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆227Updated last month
- ☆118Updated 8 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆401Updated 11 months ago
- A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)☆192Updated 4 months ago
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…☆290Updated 2 years ago
- Official Repo of LangSuitE☆84Updated last year
- LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents (ICLR 2024)☆82Updated 6 months ago
- [IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…☆97Updated 5 months ago
- Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)☆272Updated 9 months ago
- ☆54Updated last year
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆43Updated 8 months ago
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆290Updated last year
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆60Updated last year
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆159Updated last year
- Reasoning with Language Model is Planning with World Model☆180Updated 2 years ago
- [NeurIPS D&B Track 2024] Source code for the paper "Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge…☆22Updated 7 months ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆198Updated 7 months ago
- [ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"☆286Updated 8 months ago
- The source code of the paper "Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Pla…☆107Updated last year
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆396Updated 5 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆41Updated last year
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆187Updated 9 months ago
- ☆65Updated 9 months ago
- maze datasets for investigating OOD behavior of ML systems☆66Updated last month
- Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934☆151Updated last month
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆182Updated 6 months ago