SiliangZeng/Multi-Turn-RL-Agent

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SiliangZeng/Multi-Turn-RL-Agent)

SiliangZeng / Multi-Turn-RL-Agent

☆117

Alternatives and similar repositories for Multi-Turn-RL-Agent

Users that are interested in Multi-Turn-RL-Agent are comparing it to the libraries listed below

Sorting:

facebookresearch / sweet_rl
View on GitHub
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
☆263May 5, 2025Updated 10 months ago
bammt / Learn-to-check
View on GitHub
the datasets of our paper
☆11Feb 26, 2024Updated 2 years ago
brendanhogan / picoDeepResearch
View on GitHub
☆67May 23, 2025Updated 9 months ago
EsYoon7 / RLHF-TLCR
View on GitHub
[ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"
☆12Dec 6, 2024Updated last year
AngelaZZZ-611 / reasoning_models_probing
View on GitHub
☆18May 3, 2025Updated 10 months ago
GAIR-NLP / DatasetResearch
View on GitHub
DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery
☆20Sep 24, 2025Updated 5 months ago
wantbook-book / SeRL
View on GitHub
SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data
☆21Jan 24, 2026Updated last month
ReTool-RL / ReTool
View on GitHub
☆299Aug 12, 2025Updated 6 months ago
microsoft / Efficient-Large-LM-Trainer
View on GitHub
☆39Jul 25, 2024Updated last year
mll-lab-nu / RAGEN
View on GitHub
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
☆2,527Feb 27, 2026Updated last week
generalbionix / gb_examples
View on GitHub
☆18Jun 13, 2025Updated 8 months ago
SinatrasC / entropix
View on GitHub
Entropy Based Sampling and Parallel CoT Decoding
☆17Oct 9, 2024Updated last year
facebookresearch / AbstentionBench
View on GitHub
A holistic benchmark for LLM abstention
☆73Aug 27, 2025Updated 6 months ago
TextArena / UnstableBaselines
View on GitHub
☆119Feb 25, 2026Updated last week
microsoft / CollabLLM
View on GitHub
☆32Aug 7, 2025Updated 7 months ago
wjhou / ICon
View on GitHub
Code for the paper "ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation" (EMNLP'2…
☆17Dec 11, 2024Updated last year
hkust-nlp / B-STaR
View on GitHub
B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
☆86May 21, 2025Updated 9 months ago
yiqingxyq / DocLens
View on GitHub
Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)
☆22May 18, 2024Updated last year
langfengQ / verl-agent
View on GitHub
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…
☆1,611Feb 27, 2026Updated last week
benediktstroebl / agent-evals
View on GitHub
☆25May 28, 2025Updated 9 months ago
swarnaHub / System-1.x
View on GitHub
PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models
☆25Jul 22, 2024Updated last year
launchnlp / LitCab
View on GitHub
☆25Jun 10, 2025Updated 8 months ago
sy-wada / blue_benchmark_with_transformers
View on GitHub
Implementation of the BLUE benchmark with Transformers.
☆20Feb 16, 2024Updated 2 years ago
PrimeIntellect-ai / verifiers
View on GitHub
Our library for RL environments + evals
☆3,877Updated this week
neulab / ragged
View on GitHub
Retrieval Augmented Generation Generalized Evaluation Dataset
☆61Jul 16, 2025Updated 7 months ago
tsinghua-fib-lab / SmartAgent
View on GitHub
The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".
☆27Aug 20, 2025Updated 6 months ago
vickywu1022 / OntoProbe-PLMs
View on GitHub
Repo for outstanding paper@ACL 2023 "Do PLMs Know and Understand Ontological Knowledge?"
☆33Oct 16, 2023Updated 2 years ago
KlingAIResearch / PhysMaster
View on GitHub
Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning
☆57Oct 16, 2025Updated 4 months ago
sunblaze-ucb / AgentSynth
View on GitHub
AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents
☆37Oct 7, 2025Updated 5 months ago
bythyag / coldmail-generator
View on GitHub
cold-mailing ai agent that takes in email address, recipient name, and other details to send mail.
☆27Jun 5, 2025Updated 9 months ago
VLM-RL / Ocean-R1
View on GitHub
☆25Apr 9, 2025Updated 11 months ago
GanjinZero / RAMM
View on GitHub
Codes and Pre-trained models for RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training [ACM MM 202…
☆29Nov 2, 2023Updated 2 years ago
PathOnAIOrg / VisualTreeSearch-Demo
View on GitHub
[ECML-PKDD2025] Visual Tree Search of Web Agent
☆37Jul 18, 2025Updated 7 months ago
mingzhu0527 / MASHQA
View on GitHub
☆27Dec 12, 2024Updated last year
PeterGriffinJin / Search-R1
View on GitHub
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
☆4,135Nov 13, 2025Updated 3 months ago
McGill-NLP / agent-reward-bench
View on GitHub
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
☆40Aug 7, 2025Updated 7 months ago
RyanWangZf / PromptEHR
View on GitHub
EMNLP'22 | PromptEHR: Conditional Electronic Healthcare Records Generation with Prompt Learning
☆32Jun 8, 2023Updated 2 years ago
YiCheng98 / IntegrativeDecoding
View on GitHub
Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"
☆32Apr 12, 2025Updated 10 months ago
BeastyZ / LLM-Verified-Retrieval
View on GitHub
Repo for Llatrieval
☆31Aug 21, 2024Updated last year