Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.
☆546Nov 17, 2025Updated 4 months ago
Alternatives and similar repositories for LLM-RL-Papers
Users that are interested in LLM-RL-Papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of LLM with RL papers☆278Apr 24, 2024Updated last year
- ☆21Apr 12, 2024Updated last year
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆384Apr 24, 2024Updated last year
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆277Oct 27, 2025Updated 4 months ago
- [ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning☆201Dec 17, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official implementation of Zero-Hero paper☆29Feb 13, 2025Updated last year
- Implementation of TWOSOME☆82Jan 11, 2025Updated last year
- ☆89Aug 21, 2023Updated 2 years ago
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)☆19Aug 9, 2024Updated last year
- Natural Language Reinforcement Learning☆102Jul 30, 2025Updated 7 months ago
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆54Apr 19, 2024Updated last year
- ☆28Nov 7, 2025Updated 4 months ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆246Dec 11, 2025Updated 3 months ago
- LLM-Empowered State Representation for Reinforcement Learning (ICML2024 Accepted paper)☆37Jun 14, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Survey on Large Language Model-Based Game Agents☆847Feb 13, 2026Updated last month
- This repo supports integrating LLMs and communication algorithms with MARL using SMAC as the platform. It provides an end-to-end workflow…☆17Mar 8, 2025Updated last year
- [IROS2023]Learning to Solve Tasks with Exploring Prior Behaviours☆12Mar 3, 2024Updated 2 years ago
- A curated list of reinforcement learning with human feedback resources (continually updated)☆4,331Dec 9, 2025Updated 3 months ago
- A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites☆4,300Jan 27, 2026Updated last month
- ☆39Aug 10, 2025Updated 7 months ago
- ☆63Jan 30, 2026Updated last month
- This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…☆22Jul 6, 2023Updated 2 years ago
- Implementation of Stein Variational Gradient Descent with TensorFlow 2.0☆12Sep 11, 2019Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- TextStarCraft2,a pure language env which support llms play starcraft2☆304Apr 25, 2025Updated 11 months ago
- ☆63Nov 15, 2024Updated last year
- ☆15Mar 26, 2024Updated last year
- A curated list of awesome model based RL resources (continually updated)☆1,319Dec 20, 2025Updated 3 months ago
- Official Repository for 'Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences' (CVPR 2024)☆16Mar 29, 2024Updated last year
- ☆15Jan 18, 2026Updated 2 months ago
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…☆749Sep 11, 2025Updated 6 months ago
- [ICANN 2022] ''An Improved Lightweight YOLOv5 Model Based on Attention Mechanism for Face Mask Detection'' Official Code☆10Feb 27, 2024Updated 2 years ago
- Large Language Models and Robotics.☆22Apr 27, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆24Nov 4, 2024Updated last year
- This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Re…☆658Nov 29, 2024Updated last year
- A curated list of Diffusion Model in RL resources (continually updated)☆1,552Dec 15, 2025Updated 3 months ago
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)☆9,231Updated this week
- MENTOR is a highly efficient visual RL algorithm that excels in both simulation and real-world complex robotic learning tasks.☆27Jul 9, 2025Updated 8 months ago
- Train transformer language models with reinforcement learning.☆17,697Mar 18, 2026Updated last week
- A list of Offline to Online RL papers (continually updated)☆75Mar 7, 2026Updated 2 weeks ago