AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reasoning models.
☆130Mar 18, 2025Updated 11 months ago
Alternatives and similar repositories for AutoCoA
Users that are interested in AutoCoA are comparing it to the libraries listed below
Sorting:
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆156Dec 24, 2024Updated last year
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆22Jan 6, 2026Updated last month
- The official repo for our paper: LegalAgentBench: Evaluating LLM Agents in Legal Domainl☆43Dec 30, 2024Updated last year
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…☆1,328May 16, 2025Updated 9 months ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆1,246Feb 12, 2026Updated 3 weeks ago
- The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”☆17Feb 26, 2024Updated 2 years ago
- ☆21Dec 24, 2024Updated last year
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆689Aug 5, 2025Updated 7 months ago
- ☆335May 24, 2025Updated 9 months ago
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents☆48Feb 2, 2026Updated last month
- ☆444Oct 16, 2025Updated 4 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,522Updated this week
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆159Oct 30, 2024Updated last year
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,085Nov 13, 2025Updated 3 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆55Mar 21, 2025Updated 11 months ago
- ReCAP: Recursive Context-Aware Reasoning and Planning for Large Language Model Agents, NeurIPS 2025☆33Nov 15, 2025Updated 3 months ago
- [ACL'24] WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations☆13Sep 11, 2024Updated last year
- Code for Expert Supervised Reinforcement Learning☆10Apr 7, 2021Updated 4 years ago
- An MCP server providing intelligent transcript processing capabilities, featuring natural formatting, contextual repair, and smart summar…☆18Mar 14, 2025Updated 11 months ago
- The open sourced code from the Decent AI mobile app, built with Expo☆14Apr 3, 2025Updated 11 months ago
- ☆215Feb 20, 2025Updated last year
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆94Nov 8, 2025Updated 3 months ago
- A live stream development of RL tunning for LLM agents☆3,927Oct 8, 2025Updated 4 months ago
- Chapter 13 Learning to Run in book Deep Reinforcement Learning: code example of solving NIPS 2017: Learning to Run challenge with paralle…☆13Jul 4, 2021Updated 4 years ago
- Synthetic Data Generation using LLM via Argilla, Distilabel, ChatGPT, etc.☆30May 29, 2024Updated last year
- story based implementation for sequential thinking☆15Dec 15, 2025Updated 2 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆116Dec 30, 2025Updated 2 months ago
- ☆497Oct 11, 2025Updated 4 months ago
- A collection of recent open-source math datasets for training and evaluating Math LLMs☆23Dec 8, 2025Updated 2 months ago
- A Python implementation of the Sequential Thinking MCP server using the official Model Context Protocol (MCP) Python SDK. This server fac…☆24Jun 1, 2025Updated 9 months ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆16Jun 28, 2024Updated last year
- Unofficial pixabay python API client☆13Feb 6, 2023Updated 3 years ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆705Oct 15, 2025Updated 4 months ago
- [ICLR 2026] Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents☆34Feb 1, 2026Updated last month
- Repo. for RLCF.☆15Apr 1, 2024Updated last year
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆22Dec 16, 2024Updated last year
- ☆17Feb 4, 2025Updated last year
- A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs☆19Aug 3, 2024Updated last year
- ☆18Apr 18, 2025Updated 10 months ago