Timothyxxx / WorldModelPapers
Paper collections of the continuous effort start from World Models.
☆142Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for WorldModelPapers
- ☆114Updated 4 months ago
- [ICLR 2024] Code for the paper "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning"☆129Updated last month
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆108Updated 7 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆99Updated 3 weeks ago
- Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW…☆123Updated last year
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.☆199Updated 3 months ago
- A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)☆138Updated last month
- Towards Large Multimodal Models as Visual Foundation Agents☆123Updated last week
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆165Updated this week
- ☆89Updated 3 months ago
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆96Updated last year
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆109Updated 4 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆97Updated 2 months ago
- The source code of the paper "Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Pla…☆77Updated 3 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆57Updated 3 months ago
- [NIPS24W]This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated…☆73Updated 4 months ago
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆192Updated last week
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆97Updated last month
- LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents (ICLR 2024)☆62Updated 3 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆208Updated this week
- Official Repo of LangSuitE☆78Updated 3 months ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆32Updated 6 months ago
- Reasoning with Language Model is Planning with World Model☆146Updated last year
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆75Updated 9 months ago
- Text world based on Minecraft rules.☆11Updated 6 months ago
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆47Updated last month
- ☆29Updated last week
- ☆104Updated 2 weeks ago
- Code for Contrastive Preference Learning (CPL)☆154Updated this week
- [ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"☆226Updated 3 weeks ago