A high-performance RL training-inference weight synchronization framework, designed to enable second-level parameter updates from training to inference in RL workflows
☆161May 25, 2026Updated last month
Alternatives and similar repositories for asystem-awex
Users that are interested in asystem-awex are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An asynchronous streaming data management module for efficient post-training.☆103Updated this week
- ☆42Dec 9, 2025Updated 6 months ago
- A NCCL extension library, designed to efficiently offload GPU memory allocated by the NCCL communication library.☆111Dec 17, 2025Updated 6 months ago
- Gensis is a lightweight deep learning framework written from scratch in Python, with Triton as its backend for high-performance computing…☆35Jan 15, 2026Updated 5 months ago
- Official Implementation for NorMuon paper☆81Apr 30, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆14Mar 5, 2024Updated 2 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆11Dec 30, 2024Updated last year
- ☆40Nov 28, 2024Updated last year
- FlashKDA: high-performance Kimi Delta Attention kernels☆450May 26, 2026Updated last month
- Python library to add support for embedding natural code in Python with shared program state.☆30Jan 20, 2026Updated 5 months ago
- ☆10Aug 8, 2021Updated 4 years ago
- Composable and Embeddable Communication Runtime for Distributed AI Services☆102Jun 5, 2026Updated 3 weeks ago
- A curated list of resources (surveys, papers, benchmarks, and opensource projects) on Rubrics☆94Jun 15, 2026Updated 2 weeks ago
- Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.☆1,631Jun 27, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- Validate semantic equivalence between C++ and Rust LLVM IR using State-Of-The-Art Verification☆15Dec 11, 2024Updated last year
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆12Nov 8, 2024Updated last year
- NVIDIA Networking NIC Configuration Operator For Kubernetes☆20Jun 26, 2026Updated last week
- VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]☆17Jun 1, 2026Updated last month
- Generative Modeling with Bayesian Sample Inference☆24May 17, 2025Updated last year
- CATransformers is a framework for joint neural network and hardware architecture search.☆24Mar 17, 2026Updated 3 months ago
- Memory Topology for GPUs☆19Jun 26, 2026Updated last week
- A lightweight unified metrics library in Rust for various metrics system.☆20Jun 17, 2026Updated 2 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Pipeline-Parallel Lecture: Simplest Dualpipe Implementation.☆31Sep 17, 2025Updated 9 months ago
- ObjWatch is a Python library for OOP debugging with nested tracing and configurable monitoring of modules, classes, members, methods, fun…☆27Feb 5, 2026Updated 4 months ago
- Low-Latency Live Video Streaming over a Low-Earth-Orbit Satellite Network with DASH☆18Sep 6, 2024Updated last year
- [ICLR 2026] Geometric-Mean Policy Optimization☆104Jan 26, 2026Updated 5 months ago
- Suri: Multi-constraint instruction following for long-form text generation [EMNLP’24]☆27Oct 3, 2025Updated 9 months ago
- An Open Standard for Packaging, Distributing and Running LLMs in Cloud-Native Environments☆211Jun 22, 2026Updated last week
- Noisy language compiler☆17Jul 31, 2024Updated last year
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆17Aug 4, 2022Updated 3 years ago
- A storage plugin that provided CRI-O/Podman with the ability to lazy mount nydus images.☆43May 12, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- Performance Prediction for NLP Tasks☆17May 5, 2020Updated 6 years ago
- ☆33Aug 30, 2024Updated last year
- 15-721 Spring 2024 - Cache #1☆12May 2, 2024Updated 2 years ago
- Spatial Transformer Network (STN) provides attention to a particular region to in an image, by doing transformation to the input image. T…☆15Dec 21, 2020Updated 5 years ago
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year
- ☆14May 18, 2024Updated 2 years ago