NexRL is an ultra-loosely-coupled LLM post-training framework.
☆104Mar 23, 2026Updated 3 weeks ago
Alternatives and similar repositories for NexRL
Users that are interested in NexRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NexAU (AU for Agent Universe), a general-purpose agent framework for building intelligent agents with tool capabilities.☆55Apr 7, 2026Updated last week
- Nex General Agentic Data Pipeline, an end-to-end pipeline for generating high-quality agentic training data.☆33Nov 19, 2025Updated 4 months ago
- Nex Venus Communication Library☆73Nov 17, 2025Updated 4 months ago
- ☆107Dec 5, 2025Updated 4 months ago
- ☆27Mar 10, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221☆31Apr 22, 2025Updated 11 months ago
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation☆61Mar 31, 2025Updated last year
- Tiny-Megatron, a minimalistic re-implementation of the Megatron library☆23Sep 1, 2025Updated 7 months ago
- ☆35Jan 30, 2026Updated 2 months ago
- Learning MLPs to replace GNN☆10Jun 3, 2023Updated 2 years ago
- Source code for the paper 'Uncovering Neural Scaling Laws in Molecular Representation Learning' (NeurIPS 2023 Datasets and Benchmarks).☆14Dec 2, 2023Updated 2 years ago
- Information Extraction related tools and models☆10Mar 16, 2023Updated 3 years ago
- Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP☆103Aug 20, 2025Updated 7 months ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICCV 2025] Task-Specific Zero-shot Quantization-Aware Training for Object Detection☆25Sep 26, 2025Updated 6 months ago
- Does Ayano Takeda really understand Hibike! Euphonium?☆11Apr 3, 2021Updated 5 years ago
- 🚀 轻量视频🎥 大模型🤖☆22Apr 27, 2025Updated 11 months ago
- AMT-CDR: A Deep Adversarial Multi-channel Transfer Network for Cross-domain Recommendation☆11Nov 2, 2023Updated 2 years ago
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆244Aug 27, 2025Updated 7 months ago
- Softened ROSA QKV Operators for Training Next-Generation LLM Models☆36Apr 7, 2026Updated last week
- ☆27Aug 31, 2023Updated 2 years ago
- a simple API to use CUPTI☆10Aug 19, 2025Updated 7 months ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆22Apr 9, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Generic Neural Architecture Search via Regression (NeurIPS'21 Spotlight)☆36Aug 29, 2022Updated 3 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- Improved the performance of 8-bit PTQ4DM expecially on FID.☆11Aug 30, 2023Updated 2 years ago
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆15Jan 16, 2026Updated 3 months ago
- LLM benchmarks☆13Feb 22, 2024Updated 2 years ago
- ☆12Aug 18, 2023Updated 2 years ago
- [ICML 2023] Taxonomy-Structured Domain Adaptation☆12Oct 6, 2023Updated 2 years ago
- This repository consists of useful tools or guides for system software development or anything interesting.☆11Feb 27, 2026Updated last month
- ☆10Aug 21, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Large Context Attention☆770Oct 13, 2025Updated 6 months ago
- Implementation for FP8/INT8 Rollout for RL training without performence drop.☆298Nov 7, 2025Updated 5 months ago
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated last year
- ResearcherBench: Evaluating Deep AI Research Systems on the Frontiers of Scientific Inquiry☆48Jan 5, 2026Updated 3 months ago
- Verlog: A Multi-turn RL framework for LLM agents☆72Mar 27, 2026Updated 2 weeks ago
- Implementation from scratch in C of the Multi-head latent attention used in the Deepseek-v3 technical paper.☆18Jan 15, 2025Updated last year
- papers of distilling Graph Neural Network☆24Dec 11, 2021Updated 4 years ago