Timothyxxx / WorldModelPapersLinks
Paper collections of the continuous effort start from World Models.
☆173Updated 11 months ago
Alternatives and similar repositories for WorldModelPapers
Users that are interested in WorldModelPapers are comparing it to the libraries listed below
Sorting:
- ☆130Updated 11 months ago
- [ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"☆166Updated 6 months ago
- A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)☆171Updated 5 months ago
- Official Repo of LangSuitE☆84Updated 10 months ago
- ☆106Updated 2 months ago
- Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW…☆127Updated last year
- ☆54Updated 7 months ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆160Updated 3 months ago
- Code for Contrastive Preference Learning (CPL)☆171Updated 7 months ago
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆135Updated 2 weeks ago
- ☆61Updated 3 months ago
- ☆169Updated this week
- ☆95Updated 11 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆144Updated 7 months ago
- Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)☆209Updated 3 months ago
- Official repository for "RLVR-World: Training World Models with Reinforcement Learning", https://arxiv.org/abs/2505.13934☆45Updated 2 weeks ago
- ☆292Updated last week
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆161Updated last month
- OpenReivew Submission Visualization (ICLR 2024/2025)☆151Updated 8 months ago
- LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents (ICLR 2024)☆75Updated 2 weeks ago
- Natural Language Reinforcement Learning☆89Updated 6 months ago
- [IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…☆91Updated last week
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆121Updated 9 months ago
- A comprehensive collection of process reward models.☆92Updated 2 weeks ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆179Updated 2 months ago
- ☆38Updated this week
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆108Updated 2 months ago
- This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box☆14Updated 6 months ago
- ☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models☆18Updated 3 weeks ago
- ☆139Updated last month