NexRL is an ultra-loosely-coupled LLM post-training framework.
☆104Apr 27, 2026Updated last week
Alternatives and similar repositories for NexRL
Users that are interested in NexRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NexAU (AU for Agent Universe), a general-purpose agent framework for building intelligent agents with tool capabilities.☆69Updated this week
- Nex General Agentic Data Pipeline, an end-to-end pipeline for generating high-quality agentic training data.☆33Nov 19, 2025Updated 5 months ago
- ☆28Mar 10, 2026Updated last month
- Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library☆52Aug 20, 2025Updated 8 months ago
- The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221☆31Apr 22, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation☆61Mar 31, 2025Updated last year
- Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping…☆92Jan 29, 2026Updated 3 months ago
- ☆41Jan 30, 2026Updated 3 months ago
- Learning MLPs to replace GNN☆10Jun 3, 2023Updated 2 years ago
- Standardized environment infrastructure for Agentic AI development.☆297Apr 29, 2026Updated last week
- Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP☆107Aug 20, 2025Updated 8 months ago
- Official repository Flash Local Linear Attention☆23Apr 23, 2026Updated 2 weeks ago
- [ICCV 2025] Task-Specific Zero-shot Quantization-Aware Training for Object Detection☆25Sep 26, 2025Updated 7 months ago
- 🚀 轻量视频🎥 大模型🤖☆22Apr 27, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆246Aug 27, 2025Updated 8 months ago
- [NeurIPS 2024] Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation☆36May 22, 2025Updated 11 months ago
- Softened ROSA QKV Operators for Training Next-Generation LLM Models☆36Apr 7, 2026Updated last month
- ☆27Aug 31, 2023Updated 2 years ago
- code for Scaling Laws of RoPE-based Extrapolation☆73Oct 16, 2023Updated 2 years ago
- ☆105Updated this week
- a simple API to use CUPTI☆10Aug 19, 2025Updated 8 months ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆22Updated this week
- ☆150Apr 8, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Oct 11, 2023Updated 2 years ago
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆15Jan 16, 2026Updated 3 months ago
- LLM benchmarks☆13Feb 22, 2024Updated 2 years ago
- ☆12Aug 18, 2023Updated 2 years ago
- ☆13May 23, 2025Updated 11 months ago
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)☆51Apr 14, 2025Updated last year
- This repository consists of useful tools or guides for system software development or anything interesting.☆11Feb 27, 2026Updated 2 months ago
- ☆10Aug 21, 2023Updated 2 years ago
- Large Context Attention☆769Oct 13, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation for FP8/INT8 Rollout for RL training without performence drop.☆297Nov 7, 2025Updated 6 months ago
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated last year
- ResearcherBench: Evaluating Deep AI Research Systems on the Frontiers of Scientific Inquiry☆49Jan 5, 2026Updated 4 months ago
- Fuzzing for SpinalHDL☆17Oct 10, 2022Updated 3 years ago
- This is the official repo for the paper "Accelerating Parallel Sampling of Diffusion Models" Tang et al. ICML 2024 https://openreview.net…☆16Jul 19, 2024Updated last year
- Verlog: A Multi-turn RL framework for LLM agents☆74Apr 28, 2026Updated last week
- papers of distilling Graph Neural Network☆24Dec 11, 2021Updated 4 years ago