Timothyxxx / WorldModelPapers
Paper collections of the continuous effort start from World Models.
☆169Updated 8 months ago
Alternatives and similar repositories for WorldModelPapers:
Users that are interested in WorldModelPapers are comparing it to the libraries listed below
- ☆125Updated 8 months ago
- [ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"☆153Updated 3 months ago
- A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)☆158Updated 2 months ago
- Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)☆179Updated 2 weeks ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆127Updated 4 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆323Updated 3 months ago
- Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW…☆126Updated last year
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆132Updated 2 weeks ago
- ☆82Updated 8 months ago
- A brief and partial summary of RLHF algorithms.☆124Updated 2 weeks ago
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆54Updated 5 months ago
- [NeurIPSw'24] This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simu…☆83Updated last month
- Code for Contrastive Preference Learning (CPL)☆162Updated 4 months ago
- [ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain☆102Updated last year
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆153Updated 11 months ago
- LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents (ICLR 2024)☆67Updated this week
- ☆54Updated 4 months ago
- ☆44Updated last year
- [ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"☆246Updated last week
- ICLR 2025 Agent-Related Papers☆56Updated 4 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆145Updated 4 months ago
- [ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models☆170Updated this week
- Natural Language Reinforcement Learning☆80Updated 3 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆119Updated 6 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆130Updated 3 months ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆130Updated last month
- Official Repo of LangSuitE☆83Updated 7 months ago