Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
☆249Jan 17, 2026Updated 2 months ago
Alternatives and similar repositories for AgentRL
Users that are interested in AgentRL are comparing it to the libraries listed below
Sorting:
- ☆62Updated this week
- All-in-One Safety Evaluation Framwork☆46Mar 4, 2026Updated 2 weeks ago
- Search Self-Play: Pushing the Frontier of Agent Capability without Supervision☆97Mar 4, 2026Updated 2 weeks ago
- Agentic Learning Powered by AWorld☆92Updated this week
- NaturalCodeBench (Findings of ACL 2024)☆68Oct 14, 2024Updated last year
- Evaluation utilities based on SymPy.☆22Dec 12, 2024Updated last year
- ☆68Dec 23, 2025Updated 2 months ago
- The official repository of the first version of ACE-Brain foundation model.☆62Mar 13, 2026Updated last week
- Introduction and scripts for ACL-2020 paper "On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation"☆21Jun 23, 2020Updated 5 years ago
- ☆19Mar 10, 2025Updated last year
- Codes for Difflare: Removing Image Flare with Latent Diffusion Models☆11Dec 24, 2024Updated last year
- ☆29Oct 8, 2025Updated 5 months ago
- slime is an LLM post-training framework for RL Scaling.☆4,799Updated this week
- ☆140Jan 26, 2026Updated last month
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Nov 22, 2025Updated 3 months ago
- llms related stuff , including code, docs☆13Feb 25, 2025Updated last year
- Scalable and extensible reinforcement learning for LM agents.☆111Mar 12, 2026Updated last week
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆23Mar 2, 2026Updated 2 weeks ago
- A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Langu…☆87Dec 12, 2025Updated 3 months ago
- ☆13Oct 13, 2025Updated 5 months ago
- Long Context Research☆29Jan 26, 2026Updated last month
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆362Jan 12, 2026Updated 2 months ago
- Internal utility libraries for Pkl☆16Mar 10, 2026Updated last week
- [ACL 2025 Main] SceneGenAgent: Precise Industrial Scene Generation with Coding Agent☆35Nov 29, 2024Updated last year
- ☆30Jan 15, 2026Updated 2 months ago
- ☆30Jun 19, 2023Updated 2 years ago
- [SIGIR 2025] Benchmarking Recommendation, Classification, and Tracing Based on Hugging Face Knowledge Graph☆16Jun 6, 2025Updated 9 months ago
- [ICLR 2024] Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain☆10Nov 24, 2025Updated 3 months ago
- ☆14Feb 22, 2025Updated last year
- This repo is for source code of NeurIPS 2021 paper "Be Confident! Towards Trustworthy Graph Neural Networks via Confidence Calibration".☆22Jan 4, 2022Updated 4 years ago
- a within-document event coreference resolution system, trained and evaluated on the KBP corpus.☆10May 15, 2023Updated 2 years ago
- Experiments with AllenNLP on semantic parsing datasets☆17Dec 29, 2018Updated 7 years ago
- ☆43Updated this week
- PyTorch implementation of FAIR's paper "End-to-End Memory Network", NIPS 2015☆12Oct 19, 2017Updated 8 years ago
- This is the official code for the paper "Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable".☆29Mar 11, 2025Updated last year
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆37Oct 3, 2025Updated 5 months ago
- ☆27Mar 6, 2023Updated 3 years ago
- ArcherCodeR is an open-source initiative enhancing code reasoning in large language models through scalable, rule-governed reinforcement …☆45Aug 6, 2025Updated 7 months ago
- ICS_2020_PJ☆11Dec 25, 2020Updated 5 years ago