AQ-MedAI/MrlX

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AQ-MedAI/MrlX)

AQ-MedAI / MrlX

MrlX: A Multi-Agent Reinforcement Learning Framework

☆214

Alternatives and similar repositories for MrlX

Users that are interested in MrlX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AQ-MedAI / QReward
View on GitHub
[QReward] RewardService Python Client, make RL Training reward function more faster
☆16Apr 23, 2026Updated 3 months ago
AQ-MedAI / MedicalAiBenchEval
View on GitHub
A comprehensive medical AI evaluation framework based on GAPS methodology. Features automated assessment pipeline, thoracic surgery datas…
☆44Nov 3, 2025Updated 8 months ago
radixark / miles
View on GitHub
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
☆1,790Updated this week
Zhiyuan-Zeng / RLVE
View on GitHub
[ICML 2026] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
☆226Apr 30, 2026Updated 2 months ago
GMISWE / tinker-cloud
View on GitHub
Tinkering RL
☆26Jul 15, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
RLsys-Foundation / APRIL
View on GitHub
APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM tra…
☆60Oct 11, 2025Updated 9 months ago
AQ-MedAI / MedResearcher-R1
View on GitHub
MedResearcher-R1 is a deep research agent for medical scenarios, built on a knowledge-informed trajectory synthesis framework.
☆515Sep 1, 2025Updated 10 months ago
langfengQ / DrMAS
View on GitHub
Dr. MAS is an end-to-end RL training framework for multi-agent LLM systems, supporting the co-training of multiple (heterogeneous) LLMs.
☆144Apr 1, 2026Updated 3 months ago
inclusionAI / AWorld-RL
View on GitHub
Agentic Learning Powered by AWorld
☆117Jun 18, 2026Updated last month
ISEEKYAN / mbridge
View on GitHub
Bridge Megatron-Core to Hugging Face/Reinforcement Learning
☆228Jun 15, 2026Updated last month
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆7,645Updated this week
ventr1c / RES-GCL
View on GitHub
An official PyTorch implementation of "Certifiably Robust Graph Contrastive Learning" (NeurIPS 2023)
☆11Jan 22, 2024Updated 2 years ago
TransferQueue / TransferQueue
View on GitHub
[Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…
☆16Jan 16, 2026Updated 6 months ago
areal-project / AReaL
View on GitHub
The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.
☆5,604Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Chen-GX / SEER
View on GitHub
☆15Feb 10, 2025Updated last year
TIGER-AI-Lab / verl-tool
View on GitHub
A version of verl to support diverse tool use [TMLR 2026]
☆1,024Jul 15, 2026Updated last week
stepfun-ai / StepFun-Formalizer
View on GitHub
StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs through Knowledge-Reasoning Fusion
☆29Aug 19, 2025Updated 11 months ago
NovaSky-AI / SkyRL
View on GitHub
SkyRL: A Modular Full-stack RL Library for LLMs
☆2,095Updated this week
redai-infra / Relax
View on GitHub
An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale
☆542Updated this week
axon-rl / gem
View on GitHub
A Gym for Agentic LLMs
☆502Jan 21, 2026Updated 6 months ago
fzyzcjy / torch_memory_saver
View on GitHub
Allow torch tensor memory to be released and resumed later
☆261Updated this week
alibaba / ROLL
View on GitHub
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
☆3,327Updated this week
tilde-research / nsa-release
View on GitHub
An efficient implementation of the NSA (Native Sparse Attention) kernel
☆133Jun 24, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TsinghuaC3I / MARTI
View on GitHub
A Framework for LLM-based Multi-Agent Reinforced Training and Inference
☆540Apr 14, 2026Updated 3 months ago
MiroMindAI / MiroRL
View on GitHub
MiroRL is an MCP-first reinforcement learning framework for deep research agent.
☆246Aug 27, 2025Updated 10 months ago
Chen-GX / ReForm
View on GitHub
☆21Jan 31, 2026Updated 5 months ago
RLsys-Foundation / TritonForge
View on GitHub
🔥 LLM-powered GPU kernel synthesis: Train models to convert PyTorch ops into optimized Triton kernels via SFT+RL. Multi-turn compilation…
☆146Nov 10, 2025Updated 8 months ago
THUDM / DeepDive
View on GitHub
DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL
☆333Jun 17, 2026Updated last month
mzf666 / MATPO
View on GitHub
Official implementation of MATPO: Multi-Agent Tool-Integrated Policy Optimization.
☆82Oct 31, 2025Updated 8 months ago
rllm-org / rllm
View on GitHub
Democratizing Reinforcement Learning for LLMs
☆5,732Updated this week
mit-han-lab / flash-moba
View on GitHub
☆251Nov 19, 2025Updated 8 months ago
warlockee / oxRL
View on GitHub
A lightweight post-training framework for LLMs and VLMs. 51 algorithms, 38 verified models. Scales with DeepSpeed, vLLM, and Ray.
☆19Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ByteDance-Seed / Seed-1.8
View on GitHub
☆219Dec 19, 2025Updated 7 months ago
verl-project / uni-agent
View on GitHub
Uni-Agent is a framework for training long-horizon agents.
☆437Updated this week
yaof20 / Flash-RL
View on GitHub
Implementation for FP8/INT8 Rollout for RL training without performence drop.
☆307Nov 7, 2025Updated 8 months ago
vllm-project / vime
View on GitHub
An LLM post-training framework with vLLM for RL Scaling
☆387Updated this week
MoonshotAI / checkpoint-engine
View on GitHub
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
☆982Jul 4, 2026Updated 3 weeks ago
Dao-AILab / sonic-moe
View on GitHub
Accelerating MoE with IO and Tile-aware Optimizations
☆732Jul 4, 2026Updated 3 weeks ago
hkust-nlp / Toolathlon
View on GitHub
[ICLR 2026] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution
☆440Updated this week