nex-agi/NexRL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nex-agi/NexRL)

nex-agi / NexRL

NexRL is an ultra-loosely-coupled LLM post-training framework.

☆114

Alternatives and similar repositories for NexRL

Users that are interested in NexRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nex-agi / NexGAP
View on GitHub
Nex General Agentic Data Pipeline, an end-to-end pipeline for generating high-quality agentic training data.
☆36Nov 19, 2025Updated 8 months ago
nex-agi / NexHTML
View on GitHub
HTML Agent based on NexAU
☆16Nov 20, 2025Updated 8 months ago
nex-agi / NexDR
View on GitHub
NexDR (Nex Deep Research), a leading deep research agent that autonomously investigates complex topics and generates rich, structured rep…
☆36Dec 4, 2025Updated 7 months ago
nex-agi / NexVenusCL
View on GitHub
Nex Venus Communication Library
☆75Nov 17, 2025Updated 8 months ago
nex-agi / Nex-N1
View on GitHub
☆116Dec 5, 2025Updated 7 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
nex-agi / NexA4A
View on GitHub
Nex Agent for Agent is a meta-agent system that automatically creates specialized AI agents based on natural language requirements.
☆29Nov 18, 2025Updated 8 months ago
nex-agi / weaver
View on GitHub
Python SDK for Weaver.
☆17Updated this week
nex-agi / NexAU
View on GitHub
NexAU (AU for Agent Universe), a general-purpose agent framework for building intelligent agents with tool capabilities.
☆190Jul 7, 2026Updated 2 weeks ago
TransferQueue / TransferQueue
View on GitHub
[Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…
☆16Jan 16, 2026Updated 6 months ago
WooooDyy / BAPO
View on GitHub
Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping…
☆94Jan 29, 2026Updated 5 months ago
OpenLMLab / ParallelTokenizer
View on GitHub
Use the tokenizer in parallel to achieve superior acceleration
☆20Mar 21, 2024Updated 2 years ago
romitjain / kachua-mlsys
View on GitHub
[MLSys 26] 🥇 Solution for Gated Delta Net Track of MLSys 26 Flash infer competition
☆35May 22, 2026Updated 2 months ago
zhuzilin / flash-attention-with-sink
View on GitHub
☆37Aug 7, 2025Updated 11 months ago
chenyu-jiang / dcp
View on GitHub
Code repository for the SOSP'25 paper DCP: Addressing Input Dynamism In Long-Context Training via Dynamic Context Parallelism.
☆21Nov 28, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
inclusionAI / asystem-amem
View on GitHub
A NCCL extension library, designed to efficiently offload GPU memory allocated by the NCCL communication library.
☆113Dec 17, 2025Updated 7 months ago
Terra-Flux / PolyRL
View on GitHub
[NSDI'26] PolyRL is a reinforcement learning framework for LLM that harvest spot instances on the cloud to reduce cost.
☆19Mar 30, 2026Updated 3 months ago
mit-han-lab / fastrl
View on GitHub
[ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter
☆174Feb 27, 2026Updated 4 months ago
LinB203 / FSDP-Training
View on GitHub
Minimal PyTorch implementation of TP, SP, FSDP and sharded-EMA
☆32Nov 27, 2025Updated 7 months ago
MiroMindAI / MiroRL
View on GitHub
MiroRL is an MCP-first reinforcement learning framework for deep research agent.
☆246Aug 27, 2025Updated 10 months ago
ISEEKYAN / mbridge
View on GitHub
Bridge Megatron-Core to Hugging Face/Reinforcement Learning
☆226Jun 15, 2026Updated last month
yzlnew / infra-skills
View on GitHub
A collection of specialized agent skills for AI infrastructure development, enabling Claude Code to write, optimize, and debug high-perfo…
☆140Jul 9, 2026Updated 2 weeks ago
ISEEKYAN / verl_megatron_practice
View on GitHub
(best/better) practices of megatron on veRL and tuning guide
☆137May 12, 2026Updated 2 months ago
sail-sg / odc
View on GitHub
On demand communication
☆34Apr 16, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mlc-ai / pith-train
View on GitHub
Compact and Agent-Native MoE Training System
☆299Updated this week
radixark / miles
View on GitHub
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
☆1,789Updated this week
yaof20 / Flash-RL
View on GitHub
Implementation for FP8/INT8 Rollout for RL training without performence drop.
☆306Nov 7, 2025Updated 8 months ago
inclusionAI / AState
View on GitHub
☆42Updated this week
FlexFusion / FlexFusion
View on GitHub
The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221
☆31Apr 22, 2025Updated last year
oliverYoung2001 / UltraAttn
View on GitHub
SC'25 UltraAttn: Efficiently Parallelizing Attention through Hierarchical Context-Tiling
☆16Aug 14, 2025Updated 11 months ago
KuangjuX / AttnLink
View on GitHub
An experimental communicating attention kernel based on DeepEP.
☆34Jul 29, 2025Updated 11 months ago
radixark / miles_diffusion
View on GitHub
[Experimental] Miles-diffusion is an post-training framework for large-scale diffusion model training and production workloads, forked fr…
☆23Updated this week
nex-agi / Nex-N2
View on GitHub
☆347Jun 9, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
OpenMOSS / ABC-Bench
View on GitHub
ABC-Bench is a benchmark for Agentic Backend Coding. It evaluates whether code agents can explore real repositories, edit code, configure…
☆33Jan 20, 2026Updated 6 months ago
xinhaoc / ferret
View on GitHub
Autonomous CUDA kernel optimization agent with structured task specs and per-config scoring
☆17Jun 17, 2026Updated last month
open-tinker / OpenTinker
View on GitHub
OpenTinker is an RL-as-a-Service infrastructure for foundation models
☆676Mar 21, 2026Updated 4 months ago
InternLM / InternLM-WQX
View on GitHub
☆19Jul 5, 2024Updated 2 years ago
redai-infra / Relax
View on GitHub
An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale
☆538Updated this week
MoonshotAI / checkpoint-engine
View on GitHub
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
☆982Jul 4, 2026Updated 3 weeks ago
liangyuwang / Tiny-DeepSpeed
View on GitHub
Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library
☆53Aug 20, 2025Updated 11 months ago