SiliangZeng / Multi-Turn-RL-AgentView external linksLinks
☆113Jun 11, 2025Updated 8 months ago
Alternatives and similar repositories for Multi-Turn-RL-Agent
Users that are interested in Multi-Turn-RL-Agent are comparing it to the libraries listed below
Sorting:
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆262May 5, 2025Updated 9 months ago
- ☆13Aug 4, 2025Updated 6 months ago
- the datasets of our paper☆11Feb 26, 2024Updated last year
- ☆17May 3, 2025Updated 9 months ago
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Sep 24, 2025Updated 4 months ago
- SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data☆21Jan 24, 2026Updated 3 weeks ago
- ☆39Jul 25, 2024Updated last year
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,512Jan 25, 2026Updated 3 weeks ago
- ☆18Jun 13, 2025Updated 8 months ago
- A holistic benchmark for LLM abstention☆69Aug 27, 2025Updated 5 months ago
- ☆118Feb 4, 2026Updated last week
- Code for the paper "RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection" (ACL'25).☆33Jul 23, 2025Updated 6 months ago
- Code for the paper "ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation" (EMNLP'2…☆17Dec 11, 2024Updated last year
- ☆271Jan 29, 2026Updated 2 weeks ago
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆21May 18, 2024Updated last year
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…☆1,322May 16, 2025Updated 8 months ago
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆1,533Updated this week
- PyTorch adaptation of Ravens - Transporter Networks☆22Dec 14, 2022Updated 3 years ago
- ☆25May 28, 2025Updated 8 months ago
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆24Jul 22, 2024Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆95Oct 30, 2024Updated last year
- Remote Color Depth Camera without any 3rd-party dependencies in iOS.☆19May 6, 2022Updated 3 years ago
- ☆25Jun 10, 2025Updated 8 months ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Mar 30, 2023Updated 2 years ago
- Our library for RL environments + evals☆3,833Updated this week
- A Gym for Agentic LLMs☆446Jan 21, 2026Updated 3 weeks ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆61Jul 16, 2025Updated 6 months ago
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆24Jun 19, 2023Updated 2 years ago
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Sep 13, 2024Updated last year
- Repo for outstanding paper@ACL 2023 "Do PLMs Know and Understand Ontological Knowledge?"☆33Oct 16, 2023Updated 2 years ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Aug 20, 2025Updated 5 months ago
- ☆141Updated this week
- Codes and Pre-trained models for RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training [ACM MM 202…☆29Nov 2, 2023Updated 2 years ago
- cold-mailing ai agent that takes in email address, recipient name, and other details to send mail.☆27Jun 5, 2025Updated 8 months ago
- Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning☆57Oct 16, 2025Updated 4 months ago
- Code for the paper "RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning" (EMNLP'23 Findings).☆28Jun 12, 2025Updated 8 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,021Nov 13, 2025Updated 3 months ago
- The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"☆34Jun 29, 2024Updated last year
- ☆29Aug 3, 2021Updated 4 years ago