WindyLab / LLM-RL-PapersLinks

Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.

☆458

Alternatives and similar repositories for LLM-RL-Papers

Users that are interested in LLM-RL-Papers are comparing it to the libraries listed below

Sorting:

123penny123 / Awesome-LLM-RL
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
☆372Updated last year
floodsung / LLM-with-RL-papers
A collection of LLM with RL papers
☆276Updated last year
yingchengyang / Reinforcement-Learning-Papers
Related papers for reinforcement learning, including classic papers and latest papers in top conferences
☆456Updated 4 months ago
yuqingd / ellm
☆81Updated last year
RL4VLM / RL4VLM
Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
☆376Updated 7 months ago
NKAI-Decision-Team / LLM-PySC2
LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…
☆138Updated 3 months ago
PKU-RL / AdaRefiner
AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)
☆16Updated 11 months ago
devindeng94 / LLM-SMAC
A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models
☆46Updated 4 months ago
flowersteam / Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
☆268Updated 11 months ago
1989Ryan / llm-mcts
[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…
☆281Updated 8 months ago
histmeisah / Large-Language-Models-play-StarCraftII
TextStarCraft2,a pure language env which support llms play starcraft2
☆285Updated 3 months ago
apexrl / Diff4RLSurvey
This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Re…
☆591Updated 8 months ago
qiwang067 / LS-Imagine
[ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"
☆149Updated last month
PKU-Alignment / safety-gymnasium
NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
☆479Updated 5 months ago
WeihaoTan / TWOSOME
Implementation of TWOSOME
☆77Updated 6 months ago
OpenRL-Lab / openrl
Unified Reinforcement Learning Framework
☆762Updated 11 months ago
dunnolab / awesome-in-context-rl
Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —
☆208Updated 2 weeks ago
vwxyzjn / ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
☆811Updated last year
facebookresearch / online-dt
Online Decision Transformer
☆263Updated last year
PKU-MARL / HARL
Official implementation of HARL algorithms based on PyTorch.
☆749Updated 3 months ago
berkeleydeeprlcourse / homework_fall2023
☆241Updated 8 months ago
facebookresearch / BenchMARL
BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL alg…
☆442Updated 2 weeks ago
johnjim0816 / joyrl-offline
☆66Updated last year
CJReinforce / RIME_ICML2024
Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)
☆32Updated 9 months ago
Zhendong-Wang / Diffusion-Policies-for-Offline-RL
☆375Updated last year
Shanghai-Digital-Brain-Laboratory / BDM-DB1
A large-scale multi-modal pre-trained model
☆132Updated 2 years ago
nikhilbarhate99 / min-decision-transformer
Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…
☆278Updated 3 years ago
jannerm / trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
☆510Updated 2 years ago
flowersteam / lamorel
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
☆236Updated 9 months ago
OpenDFM / Rememberer
[NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents
☆34Updated last year