☆117Jun 11, 2025Updated 8 months ago
Alternatives and similar repositories for Multi-Turn-RL-Agent
Users that are interested in Multi-Turn-RL-Agent are comparing it to the libraries listed below
Sorting:
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆263May 5, 2025Updated 10 months ago
- the datasets of our paper☆11Feb 26, 2024Updated 2 years ago
- ☆67May 23, 2025Updated 9 months ago
- [ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"☆12Dec 6, 2024Updated last year
- ☆18May 3, 2025Updated 10 months ago
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Sep 24, 2025Updated 5 months ago
- SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data☆21Jan 24, 2026Updated last month
- ☆299Aug 12, 2025Updated 6 months ago
- ☆39Jul 25, 2024Updated last year
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,527Feb 27, 2026Updated last week
- ☆18Jun 13, 2025Updated 8 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Oct 9, 2024Updated last year
- A holistic benchmark for LLM abstention☆73Aug 27, 2025Updated 6 months ago
- ☆119Feb 25, 2026Updated last week
- ☆32Aug 7, 2025Updated 7 months ago
- Code for the paper "ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation" (EMNLP'2…☆17Dec 11, 2024Updated last year
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆86May 21, 2025Updated 9 months ago
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆22May 18, 2024Updated last year
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆1,611Feb 27, 2026Updated last week
- ☆25May 28, 2025Updated 9 months ago
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆25Jul 22, 2024Updated last year
- ☆25Jun 10, 2025Updated 8 months ago
- Implementation of the BLUE benchmark with Transformers.☆20Feb 16, 2024Updated 2 years ago
- Our library for RL environments + evals☆3,877Updated this week
- Retrieval Augmented Generation Generalized Evaluation Dataset☆61Jul 16, 2025Updated 7 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Aug 20, 2025Updated 6 months ago
- Repo for outstanding paper@ACL 2023 "Do PLMs Know and Understand Ontological Knowledge?"☆33Oct 16, 2023Updated 2 years ago
- Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning☆57Oct 16, 2025Updated 4 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆37Oct 7, 2025Updated 5 months ago
- cold-mailing ai agent that takes in email address, recipient name, and other details to send mail.☆27Jun 5, 2025Updated 9 months ago
- ☆25Apr 9, 2025Updated 11 months ago
- Codes and Pre-trained models for RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training [ACM MM 202…☆29Nov 2, 2023Updated 2 years ago
- [ECML-PKDD2025] Visual Tree Search of Web Agent☆37Jul 18, 2025Updated 7 months ago
- ☆27Dec 12, 2024Updated last year
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,135Nov 13, 2025Updated 3 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Aug 7, 2025Updated 7 months ago
- EMNLP'22 | PromptEHR: Conditional Electronic Healthcare Records Generation with Prompt Learning☆32Jun 8, 2023Updated 2 years ago
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆32Apr 12, 2025Updated 10 months ago
- Repo for Llatrieval☆31Aug 21, 2024Updated last year