WindyLab / LLM-RL-PapersLinks
Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.
☆521Updated 2 weeks ago
Alternatives and similar repositories for LLM-RL-Papers
Users that are interested in LLM-RL-Papers are comparing it to the libraries listed below
Sorting:
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆383Updated last year
- A collection of LLM with RL papers☆278Updated last year
- Related papers for reinforcement learning, including classic papers and latest papers in top conferences☆504Updated 3 weeks ago
- ☆88Updated 2 years ago
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆49Updated 8 months ago
- Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —☆259Updated 2 months ago
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆142Updated 7 months ago
- Implementation of TWOSOME☆82Updated 10 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆400Updated 11 months ago
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)☆18Updated last year
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆290Updated last year
- Online Decision Transformer☆273Updated last year
- This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Re…☆634Updated last year
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆276Updated last month
- TextStarCraft2,a pure language env which support llms play starcraft2☆293Updated 7 months ago
- ☆293Updated last year
- 📖 Full Stack Practice of the Large Language Model Training @ RLChina 2024☆40Updated last year
- Official implementation of HARL algorithms based on PyTorch.☆824Updated 7 months ago
- The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization☆888Updated last year
- BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL alg…☆521Updated 3 weeks ago
- Must-read Papers on Large Language Model (LLM) Planning.☆433Updated last year
- NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark☆519Updated this week
- Unified Reinforcement Learning Framework☆797Updated last year
- ☆13Updated last year
- ☆407Updated last year
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆17Updated 9 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆165Updated 2 years ago
- [ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning☆190Updated 11 months ago
- ☆456Updated last year
- This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Met…☆160Updated last year