floodsung / LLM-with-RL-papers
A collection of LLM with RL papers
☆248Updated 9 months ago
Alternatives and similar repositories for LLM-with-RL-papers:
Users that are interested in LLM-with-RL-papers are comparing it to the libraries listed below
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆347Updated 9 months ago
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆272Updated 4 months ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆233Updated 5 months ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆33Updated 8 months ago
- This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Met…☆124Updated 4 months ago
- A large-scale multi-modal pre-trained model☆130Updated last year
- ☆69Updated last year
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆248Updated 2 months ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆209Updated 2 months ago
- Reinforcement learning and planning for Minecraft.☆168Updated 10 months ago
- Implementation of TWOSOME☆62Updated 2 weeks ago
- TextStarCraft2,a pure language env which support llms play starcraft2☆241Updated last month
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆252Updated last month
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆157Updated last year
- Code for Contrastive Preference Learning (CPL)☆159Updated 2 months ago
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…☆267Updated last year
- ALFWorld: Aligning Text and Embodied Environments for Interactive Learning☆399Updated 3 weeks ago
- [ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"☆242Updated 3 months ago
- Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)☆167Updated last year
- [ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"☆141Updated last month
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆125Updated 10 months ago
- ☆106Updated last year
- RLHF implementation details of OAI's 2019 codebase☆171Updated last year
- NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark☆416Updated 8 months ago
- ☆123Updated 6 months ago
- A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM☆66Updated 5 months ago
- The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization☆678Updated 10 months ago
- (ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training☆257Updated 8 months ago
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆100Updated 4 months ago
- Online Decision Transformer☆246Updated last year