WindyLab / LLM-RL-PapersLinks
Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.
☆533Updated last month
Alternatives and similar repositories for LLM-RL-Papers
Users that are interested in LLM-RL-Papers are comparing it to the libraries listed below
Sorting:
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆384Updated last year
- A collection of LLM with RL papers☆278Updated last year
- ☆88Updated 2 years ago
- Related papers for reinforcement learning, including classic papers and latest papers in top conferences☆510Updated last month
- Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —☆261Updated 3 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆404Updated last year
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆51Updated 8 months ago
- Implementation of TWOSOME☆82Updated 11 months ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆276Updated 2 months ago
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆291Updated last year
- TextStarCraft2,a pure language env which support llms play starcraft2☆293Updated 8 months ago
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)☆18Updated last year
- This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Re…☆639Updated last year
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆145Updated 8 months ago
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆18Updated 10 months ago
- Online Decision Transformer☆274Updated last year
- The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization☆905Updated last year
- 📖 Full Stack Practice of the Large Language Model Training @ RLChina 2024☆40Updated last year
- ☆300Updated last year
- ☆17Updated last year
- NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark☆528Updated 3 weeks ago
- A large-scale multi-modal pre-trained model☆132Updated 2 years ago
- ALFWorld: Aligning Text and Embodied Environments for Interactive Learning☆607Updated 5 months ago
- Official implementation of HARL algorithms based on PyTorch.☆835Updated 8 months ago
- This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Met…☆160Updated last year
- A Survey on Large Language Model-Based Game Agents☆788Updated last month
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆243Updated 2 weeks ago
- Unified Reinforcement Learning Framework☆802Updated last year
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆166Updated 2 years ago
- ☆413Updated last year