Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
☆279Jan 17, 2026Updated 3 months ago
Alternatives and similar repositories for AgentRL
Users that are interested in AgentRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- Scalable and extensible reinforcement learning for LM agents.☆114Apr 18, 2026Updated 2 weeks ago
- 🎉 TrustJudge is accepted to ICLR 2026!☆46Sep 27, 2025Updated 7 months ago
- Spectral Sphere Optimizer☆114Mar 23, 2026Updated last month
- Search Self-Play: Pushing the Frontier of Agent Capability without Supervision☆99Mar 4, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Agentic Learning Powered by AWorld☆104Apr 16, 2026Updated 2 weeks ago
- PyTorch implementation for the paper "The Case for Cleaner Biosignals: High-fidelity Neural Compressor Enables Transfer from Cleaner iEEG…☆19Sep 18, 2025Updated 7 months ago
- 2022年春复旦大学大二下组成与体系结构实验☆16Feb 20, 2023Updated 3 years ago
- [NeurIPS 24] Can LLMs Solve Molecule Puzzles? A Multimodal Benchmark for Molecular Structure Elucidation☆18Jan 2, 2026Updated 4 months ago
- NaturalCodeBench (Findings of ACL 2024)☆70Oct 14, 2024Updated last year
- ☆83Dec 23, 2025Updated 4 months ago
- Universal preflight security scanner for AI coding agents — Detects hooks injection, credential exfiltration & backdoors in .cursorrules,…☆69Apr 9, 2026Updated 3 weeks ago
- Introduction and scripts for ACL-2020 paper "On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation"☆21Jun 23, 2020Updated 5 years ago
- ☆19Mar 10, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ASID-Caption: Attribute-Structured and Quality-Verified Audiovisual Instruction Dataset and Training Pipeline for Fine-Grained Video Unde…☆63Mar 3, 2026Updated 2 months ago
- slime is an LLM post-training framework for RL Scaling.☆5,548Updated this week
- ☆30Oct 8, 2025Updated 6 months ago
- CBU5201 Deception Dataset☆20Dec 10, 2024Updated last year
- [NeurIPS 2025] The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"☆32Jul 6, 2025Updated 9 months ago
- The official repository of the first version of ACE-Brain foundation model.☆75Mar 13, 2026Updated last month
- Official code for "SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization"☆235Apr 7, 2026Updated 3 weeks ago
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Nov 22, 2025Updated 5 months ago
- PeRL: Parameter-Efficient Reinforcement Learning☆75Apr 21, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Langu…☆88Dec 12, 2025Updated 4 months ago
- ☆13Oct 13, 2025Updated 6 months ago
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆1,853Feb 27, 2026Updated 2 months ago
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework☆21,046Updated this week
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆383Mar 30, 2026Updated last month
- Internal utility libraries for Pkl☆16Apr 24, 2026Updated last week
- [ACL 2025 Main] SceneGenAgent: Precise Industrial Scene Generation with Coding Agent☆36Nov 29, 2024Updated last year
- [SIGIR 2025] Benchmarking Recommendation, Classification, and Tracing Based on Hugging Face Knowledge Graph☆16Jun 6, 2025Updated 10 months ago
- Long Context Research☆32Jan 26, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆14Feb 22, 2025Updated last year
- This repo is for source code of NeurIPS 2021 paper "Be Confident! Towards Trustworthy Graph Neural Networks via Confidence Calibration".☆22Jan 4, 2022Updated 4 years ago
- a within-document event coreference resolution system, trained and evaluated on the KBP corpus.☆10May 15, 2023Updated 2 years ago
- Distributed IO-aware Attention algorithm☆24Sep 24, 2025Updated 7 months ago
- ☆44Mar 31, 2026Updated last month
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆717Feb 15, 2026Updated 2 months ago
- (best/better) practices of megatron on veRL and tuning guide☆132Updated this week