ADaM-BJTU/AutoCoA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ADaM-BJTU/AutoCoA)

ADaM-BJTU / AutoCoA

AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reasoning models.

☆132

Alternatives and similar repositories for AutoCoA

Users that are interested in AutoCoA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ADaM-BJTU / W2SG
View on GitHub
The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”
☆17Feb 26, 2024Updated 2 years ago
ADaM-BJTU / OpenRFT
View on GitHub
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning
☆157Dec 24, 2024Updated last year
ADaM-BJTU / Mind_with_eyes_Awesome_MLLMs_Reasoning
View on GitHub
This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!
☆56Mar 21, 2025Updated last year
ADaM-BJTU / MemAct
View on GitHub
☆30Nov 29, 2025Updated 7 months ago
AgentR1 / Agent-R1
View on GitHub
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
☆1,570Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
elena-luo / SODE
View on GitHub
☆52Apr 8, 2025Updated last year
Agent-RL / ReCall
View on GitHub
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…
☆1,412May 16, 2025Updated last year
RUCAIBox / R1-Searcher
View on GitHub
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
☆720Aug 5, 2025Updated 11 months ago
MozerWang / AMPO
View on GitHub
[ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents
☆51Feb 2, 2026Updated 5 months ago
Tongyi-CCAI / Complex-IF
View on GitHub
☆34Jan 26, 2026Updated 5 months ago
mll-lab-nu / RAGEN
View on GitHub
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
☆2,756Apr 14, 2026Updated 3 months ago
ADaM-BJTU / O1-CODER
View on GitHub
AN O1 REPLICATION FOR CODING
☆332Dec 11, 2024Updated last year
Yifan-Song793 / ETO
View on GitHub
Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)
☆168Oct 30, 2024Updated last year
Freder-chen / ReasonGenRM
View on GitHub
A simple implementation of ReasonGenRM.
☆19Apr 21, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
GAIR-NLP / ToRL
View on GitHub
☆352May 24, 2025Updated last year
xiwenc1 / DRA-GRPO
View on GitHub
Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models
☆24Jan 6, 2026Updated 6 months ago
tidbcloud / tiinsight
View on GitHub
☆22Dec 24, 2024Updated last year
CSHaitao / LegalAgentBench
View on GitHub
The official repo for our paper: LegalAgentBench: Evaluating LLM Agents in Legal Domainl
☆49Apr 10, 2026Updated 3 months ago
GAIR-NLP / LIMR
View on GitHub
☆221Feb 20, 2025Updated last year
amao0o0 / awesome-AI-Math-Datasets
View on GitHub
A collection of recent open-source math datasets for training and evaluating Math LLMs
☆32Apr 26, 2026Updated 2 months ago
0russwest0 / Awesome-Agent-RL
View on GitHub
☆511Oct 11, 2025Updated 9 months ago
PeterGriffinJin / Search-R1
View on GitHub
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
☆5,150Nov 13, 2025Updated 8 months ago
chuzhumin98 / PRE
View on GitHub
A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs
☆19Aug 3, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
cjj826 / GoalAct
View on GitHub
The repo for our paper: Enhancing LLM-Based Agents via Global Planning and Hierarchical Execution (NCIIP 2025 Best Paper)
☆17Aug 18, 2025Updated 11 months ago
RUC-NLPIR / HiRA
View on GitHub
The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search [SIGIR 2026]
☆65Jul 4, 2025Updated last year
qiancheng0 / ToolRL
View on GitHub
☆514Oct 16, 2025Updated 9 months ago
MozerWang / DEMO
View on GitHub
[ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling
☆22Dec 16, 2024Updated last year
T-Lab-CUHKSZ / G2RPO-A
View on GitHub
[ACL 2026] G2RPO-A: Guided Group Relative Policy Optimization with Adaptive Guidance
☆16May 20, 2026Updated 2 months ago
asonabend / ESRL
View on GitHub
Code for Expert Supervised Reinforcement Learning
☆10Apr 7, 2021Updated 5 years ago
facebookresearch / sweet_rl
View on GitHub
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
☆271May 5, 2025Updated last year
gouki510 / Topology_of_Reasoning
View on GitHub
☆42Jun 11, 2025Updated last year
wenjunli-0 / deepresearch-survey
View on GitHub
a survey on deep research
☆48Sep 9, 2025Updated 10 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
OpenManus / OpenManus-RL
View on GitHub
A live stream development of RL tunning for LLM agents
☆4,140May 5, 2026Updated 2 months ago
CSQianDong / RLCF
View on GitHub
Repo. for RLCF.
☆15Apr 1, 2024Updated 2 years ago
RyanLiu112 / GenPRM
View on GitHub
[AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".
☆102Nov 8, 2025Updated 8 months ago
wenzhe-li / Self-MoA
View on GitHub
☆17Feb 4, 2025Updated last year
chicosirius / think-or-not
View on GitHub
☆22May 23, 2025Updated last year
GaryStack / Trustworthy-Evaluation
View on GitHub
Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)
☆19Jul 19, 2025Updated last year
DaoD / SPRING
View on GitHub
[AAAI'25] SPRING: Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models
☆26Sep 24, 2025Updated 10 months ago