Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
☆221Jan 17, 2026Updated last month
Alternatives and similar repositories for AgentRL
Users that are interested in AgentRL are comparing it to the libraries listed below
Sorting:
- Long Context Research☆26Jan 26, 2026Updated last month
- Agentic Learning Powered by AWorld☆90Feb 13, 2026Updated 2 weeks ago
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Nov 22, 2025Updated 3 months ago
- NaturalCodeBench (Findings of ACL 2024)☆68Oct 14, 2024Updated last year
- ☆56Updated this week
- Experiments with AllenNLP on semantic parsing datasets☆17Dec 29, 2018Updated 7 years ago
- CBU5201 Deception Dataset☆20Dec 10, 2024Updated last year
- Introduction and scripts for ACL-2020 paper "On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation"☆21Jun 23, 2020Updated 5 years ago
- ☆33Jul 15, 2025Updated 7 months ago
- Spectral Sphere Optimizer☆99Jan 14, 2026Updated last month
- ☆52Oct 10, 2024Updated last year
- [EMNLP 2025] Official codebase for Rearank: Reasoning Re-ranking Agent☆33Aug 20, 2025Updated 6 months ago
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆358Jan 12, 2026Updated last month
- This repo is for source code of NeurIPS 2021 paper "Be Confident! Towards Trustworthy Graph Neural Networks via Confidence Calibration".☆22Jan 4, 2022Updated 4 years ago
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆113Jun 13, 2025Updated 8 months ago
- ☆30Jun 19, 2023Updated 2 years ago
- Distributed IO-aware Attention algorithm☆24Sep 24, 2025Updated 5 months ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆68Dec 8, 2025Updated 2 months ago
- ☆34Dec 18, 2025Updated 2 months ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆48Jan 8, 2026Updated last month
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆46Jul 17, 2025Updated 7 months ago
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆66Jan 13, 2026Updated last month
- Codes for paper SoAy: A Service-oriented APIs Applying Framework of Large Language Models☆27Jul 14, 2025Updated 7 months ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆30Mar 5, 2024Updated last year
- A unified tokenization tool for Images, Chinese and English.☆153Mar 23, 2023Updated 2 years ago
- ☆30May 22, 2024Updated last year
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28May 28, 2024Updated last year
- ☆136Jan 26, 2026Updated last month
- ☆37Aug 28, 2025Updated 6 months ago
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆31Aug 15, 2024Updated last year
- slime is an LLM post-training framework for RL Scaling.☆4,381Updated this week
- ☆27Mar 6, 2023Updated 2 years ago
- ☆28Nov 10, 2025Updated 3 months ago
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Dec 12, 2024Updated last year
- An open-source session replay tool for single-page applications that uses AI analysis, aggregated trends, and a RAG chatbot to help devel…☆11Jan 23, 2026Updated last month
- [ICLR 2025] Official PyTorch Implementation for CPE: Concept Pinpoint Eraser for Text-to-image Diffusion Models via Residual Attention Ga…☆12Apr 7, 2025Updated 10 months ago
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆233Aug 27, 2025Updated 6 months ago
- ☆76Nov 22, 2024Updated last year