pettingllms-ai/PettingLLMs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pettingllms-ai/PettingLLMs)

pettingllms-ai / PettingLLMs

[ICLR'26] Stronger-MAS: A RL Framework for multi LLM agent system; [arxiv] MetaAgent-X: End-to-End Reinforcement Learning Automatic Multi-Agent Systems

☆206

Alternatives and similar repositories for PettingLLMs

Users that are interested in PettingLLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

langfengQ / DrMAS
View on GitHub
Dr. MAS is an end-to-end RL training framework for multi-agent LLM systems, supporting the co-training of multiple (heterogeneous) LLMs.
☆145Updated this week
TsinghuaC3I / MARTI
View on GitHub
A Framework for LLM-based Multi-Agent Reinforced Training and Inference
☆540Apr 14, 2026Updated 3 months ago
mzf666 / MATPO
View on GitHub
Official implementation of MATPO: Multi-Agent Tool-Integrated Policy Optimization.
☆82Oct 31, 2025Updated 8 months ago
jwliao-ai / MARFT
View on GitHub
☆86May 14, 2026Updated 2 months ago
chanwoo-park-official / MAPoRL
View on GitHub
☆54Sep 6, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
langfengQ / verl-agent
View on GitHub
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…
☆2,158Jun 9, 2026Updated last month
WangHanLinHenry / STeCa
View on GitHub
(ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"
☆29Mar 2, 2026Updated 4 months ago
Terra-Flux / PolyRL
View on GitHub
[NSDI'26] PolyRL is a reinforcement learning framework for LLM that harvest spot instances on the cloud to reduce cost.
☆19Mar 30, 2026Updated 3 months ago
AMA-Bench / AMA-Bench
View on GitHub
[ICML 26] An evaluation framework assessing long-context retention and long-horizon memory performance for agentic applications (AMA-benc…
☆64Jun 15, 2026Updated last month
chenyiqun / UnityMAS-O
View on GitHub
☆57Jul 13, 2026Updated 2 weeks ago
WangHanLinHenry / SPA-RL-Agent
View on GitHub
Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"
☆89Sep 13, 2025Updated 10 months ago
xxyQwQ / CoMAS
View on GitHub
Implementation for the paper "CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards".
☆53Jan 26, 2026Updated 6 months ago
LiangThree / MCMA
View on GitHub
☆16Jan 12, 2026Updated 6 months ago
BRZ911 / ViTCoT
View on GitHub
[ACM MM 2025] ViTCoT: Video-Text Interleaved Chain-of-Thought for Boosting Video Understanding in Large Language Models
☆18Jul 15, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
hanningzhang / prm
View on GitHub
☆17Nov 3, 2024Updated last year
rllm-org / rllm
View on GitHub
Democratizing Reinforcement Learning for LLMs
☆5,740Updated this week
Gen-Verse / CURE
View on GitHub
[NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning
☆167Sep 19, 2025Updated 10 months ago
RUC-NLPIR / ARPO
View on GitHub
[ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)
☆1,093Jul 13, 2026Updated 2 weeks ago
spiral-rl / spiral
View on GitHub
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
☆199Mar 27, 2026Updated 4 months ago
mll-lab-nu / RAGEN
View on GitHub
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
☆2,757Updated this week
CharlesQ9 / Self-Evolving-Agents
View on GitHub
☆1,262Oct 15, 2025Updated 9 months ago
shiqichen17 / SPA
View on GitHub
Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"
☆36Nov 1, 2025Updated 8 months ago
ysy-phoenix / evalhub
View on GitHub
All-in-one benchmarking platform for evaluating LLM.
☆15Nov 12, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AQ-MedAI / MrlX
View on GitHub
MrlX: A Multi-Agent Reinforcement Learning Framework
☆215Jan 19, 2026Updated 6 months ago
Qwen-Applications / SSP
View on GitHub
Search Self-Play: Pushing the Frontier of Agent Capability without Supervision
☆20Dec 30, 2025Updated 6 months ago
HHHHHejia / Awesome-AgenticLLM-RL-Papers
View on GitHub
☆1,849Jun 18, 2026Updated last month
SII-MARFT / MARFT
View on GitHub
☆20May 14, 2026Updated 2 months ago
mll-lab-nu / VAGEN
View on GitHub
World model reasoning RL for multi-turn VLM agents
☆488Updated this week
ErikZ719 / CoTA
View on GitHub
[ICLR 26] Context Tokens are Anchors: Understanding the Repeat Curse in dMLLMs from an Information Flow Perspective
☆16Mar 6, 2026Updated 4 months ago
zhaoxlpku / PromptCoT
View on GitHub
☆17Apr 10, 2025Updated last year
XinshuangL / SELF-PARAM
View on GitHub
The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"
☆15May 18, 2025Updated last year
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆7,679Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
TIGER-AI-Lab / verl-tool
View on GitHub
A version of verl to support diverse tool use [TMLR 2026]
☆1,026Jul 15, 2026Updated 2 weeks ago
1229095296 / ResRL
View on GitHub
This repository includes code for our paper: ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning…
☆15May 2, 2026Updated 2 months ago
Infini-AI-Lab / GRESO
View on GitHub
☆82Jun 8, 2026Updated last month
EIT-NLP / BLEUless_DocMT
View on GitHub
☆14Nov 19, 2024Updated last year
WooooDyy / AgentGym-RL
View on GitHub
Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…
☆822Feb 15, 2026Updated 5 months ago
ganler / code-r1
View on GitHub
Reproducing R1 for Code with Reliable Rewards
☆313May 5, 2025Updated last year
ulab-uiuc / Multi-agent-evolve
View on GitHub
☆153Jan 21, 2026Updated 6 months ago