abdulhaim/LMRL-Gym

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/abdulhaim/LMRL-Gym)

abdulhaim / LMRL-Gym

☆116

Alternatives and similar repositories for LMRL-Gym

Users that are interested in LMRL-Gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YifeiZhou02 / ArCHer
View on GitHub
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
☆208Apr 17, 2025Updated last year
facebookresearch / sweet_rl
View on GitHub
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
☆271May 5, 2025Updated last year
WooooDyy / AgentGym
View on GitHub
Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…
☆817May 30, 2026Updated last month
amazon-science / PAE
View on GitHub
☆70Mar 6, 2025Updated last year
yuqingd / ellm
View on GitHub
☆91Aug 21, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
csmile-1006 / ARP
View on GitHub
Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)
☆33Sep 25, 2023Updated 2 years ago
WentseChen / Verlog
View on GitHub
Verlog: A Multi-turn RL framework for LLM agents
☆73Apr 28, 2026Updated 2 months ago
dpaiton / DeepSparseCoding
View on GitHub
Hierarchical Models for Learning Features from Images and Videos
☆12Feb 7, 2023Updated 3 years ago
scaleapi / SWE-Interact
View on GitHub
New testbed of interactive SWE tasks for coding agents, set in a realistic multi-turn developer driven environment
☆24Jun 30, 2026Updated 3 weeks ago
likenneth / dialogue_action_token
View on GitHub
Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner
☆31Jun 27, 2024Updated 2 years ago
mll-lab-nu / RAGEN
View on GitHub
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
☆2,756Updated this week
microsoft / LLF-Bench
View on GitHub
A benchmark for evaluating learning agents based on just language feedback
☆98Mar 26, 2026Updated 4 months ago
ServiceNow / BrowserGym
View on GitHub
🌎💪 BrowserGym, a Gym environment for web task automation
☆1,289Jul 17, 2026Updated last week
Yifan-Song793 / ETO
View on GitHub
Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)
☆168Oct 30, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
princeton-nlp / WebShop
View on GitHub
[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
☆572Sep 6, 2024Updated last year
irhum / esmjax
View on GitHub
ESM2 protein language models in JAX/Flax
☆19Oct 10, 2022Updated 3 years ago
TextArena / TextArena
View on GitHub
A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning
☆411Updated this week
DigiRL-agent / digirl
View on GitHub
Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
☆393Feb 22, 2025Updated last year
allenai / reward-bench
View on GitHub
RewardBench: the first evaluation tool for reward models.
☆727Feb 16, 2026Updated 5 months ago
OSU-NLP-Group / TravelPlanner
View on GitHub
[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"
☆531May 24, 2026Updated 2 months ago
liyheng / FOP
View on GitHub
☆14Jul 12, 2021Updated 5 years ago
RL4VLM / RL4VLM
View on GitHub
Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
☆415Dec 15, 2024Updated last year
google-research / reincarnating_rl
View on GitHub
[NeurIPS 2022] Open source code for reusing prior computational work in RL.
☆100Jul 5, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Trae1ounG / Pretrain_Space_RLVR
View on GitHub
[arxiv: 2604.14142] From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space
☆17Apr 16, 2026Updated 3 months ago
ZhaolinGao / REFUEL
View on GitHub
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
☆25Oct 8, 2024Updated last year
langfengQ / verl-agent
View on GitHub
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…
☆2,153Jun 9, 2026Updated last month
microsoft / tale-suite
View on GitHub
Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.
☆30Jul 17, 2026Updated last week
mickelliu / selfplay-redteaming
View on GitHub
☆37Oct 21, 2025Updated 9 months ago
allenai / ScienceWorld
View on GitHub
ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.
☆370Dec 3, 2025Updated 7 months ago
sunblaze-ucb / omega
View on GitHub
☆47Jun 24, 2025Updated last year
spiral-rl / spiral
View on GitHub
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
☆199Mar 27, 2026Updated 3 months ago
benellis3 / pymarl2
View on GitHub
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
☆19Aug 20, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
pearls-lab / meow-tea-taro
View on GitHub
A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning
☆83Jan 16, 2026Updated 6 months ago
scandukuri / assistant-gate
View on GitHub
☆28May 29, 2024Updated 2 years ago
wbbeyourself / DTE
View on GitHub
Detect-Then-Explain Framework for Text-to-SQL task
☆10Dec 6, 2023Updated 2 years ago
flowersteam / Grounding_LLMs_with_online_RL
View on GitHub
We perform functional grounding of LLMs' knowledge in BabyAI-Text
☆276Oct 27, 2025Updated 8 months ago
ServiceNow / PipelineRL
View on GitHub
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
☆430Updated this week
Timothyxxx / NeuralSymbolicPapers
View on GitHub
☆14Aug 18, 2022Updated 3 years ago
Linear95 / SPAG
View on GitHub
Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024
☆145Feb 24, 2025Updated last year