floodsung / LLM-with-RL-papersLinks

A collection of LLM with RL papers

☆276

Alternatives and similar repositories for LLM-with-RL-papers

Users that are interested in LLM-with-RL-papers are comparing it to the libraries listed below

Sorting:

123penny123 / Awesome-LLM-RL
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
☆372Updated last year
WindyLab / LLM-RL-Papers
Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.
☆458Updated 10 months ago
OpenDFM / Rememberer
[NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents
☆34Updated last year
flowersteam / Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
☆268Updated 11 months ago
1989Ryan / llm-mcts
[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…
☆281Updated 8 months ago
Shanghai-Digital-Brain-Laboratory / BDM-DB1
A large-scale multi-modal pre-trained model
☆132Updated 2 years ago
BAAI-Agents / GPA-LM
This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Met…
☆150Updated 11 months ago
CraftJarvis / MC-Planner
Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…
☆282Updated 2 years ago
histmeisah / Large-Language-Models-play-StarCraftII
TextStarCraft2,a pure language env which support llms play starcraft2
☆285Updated 3 months ago
alfworld / alfworld
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
☆499Updated 2 weeks ago
flowersteam / lamorel
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
☆236Updated 9 months ago
YifeiZhou02 / ArCHer
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
☆185Updated 3 months ago
PKU-RL / Plan4MC
[NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks
☆189Updated last year
WeihaoTan / TWOSOME
Implementation of TWOSOME
☆77Updated 6 months ago
haotiansun14 / AdaPlanner
AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback
☆111Updated 4 months ago
yuqingd / ellm
☆81Updated last year
microsoft / SmartPlay
SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …
☆140Updated last year
liziniu / ReMax
Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)
☆189Updated last year
jhejna / cpl
Code for Contrastive Preference Learning (CPL)
☆173Updated 8 months ago
mengdi-li / awesome-RLAIF
A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)
☆177Updated last week
louieworth / awesome-rlhf
An index of algorithms for reinforcement learning from human feedback (rlhf))
☆92Updated last year
UMass-Embodied-AGI / CoELA
[ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"
☆265Updated 4 months ago
PKU-Alignment / ProAgent
AAAI24(Oral) ProAgent: Building Proactive Cooperative Agents with Large Language Models
☆89Updated 5 months ago
AGI-Edgerunners / LLM-Planning-Papers
Must-read Papers on Large Language Model (LLM) Planning.
☆423Updated last year
dunnolab / awesome-in-context-rl
Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —
☆208Updated 2 weeks ago
karthikv792 / LLMs-Planning
An extensible benchmark for evaluating large language models on planning
☆393Updated last month
RL4VLM / RL4VLM
Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
☆376Updated 7 months ago
xlang-ai / text2reward
[ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"
☆172Updated 7 months ago
HosnLS / Hierarchical-Language-Agent
☆33Updated last year
csmile-1006 / PreferenceTransformer
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
☆163Updated last year