A NCCL extension library, designed to efficiently offload GPU memory allocated by the NCCL communication library.
☆109Dec 17, 2025Updated 5 months ago
Alternatives and similar repositories for asystem-amem
Users that are interested in asystem-amem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆42Dec 9, 2025Updated 6 months ago
- An experimental communicating attention kernel based on DeepEP.☆34Jul 29, 2025Updated 10 months ago
- Tutorials for NVIDIA CUPTI samples☆68Nov 3, 2025Updated 7 months ago
- ☆20Nov 18, 2023Updated 2 years ago
- An ultra-fast, distributed Safetensors loader☆60May 27, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Mar 15, 2026Updated 2 months ago
- ☆31Apr 8, 2026Updated 2 months ago
- ☆28Jun 2, 2026Updated last week
- Large language models to diffusion finetuning code☆26Jun 2, 2025Updated last year
- ☆66Apr 26, 2025Updated last year
- ☆168Dec 27, 2024Updated last year
- ☆57Feb 24, 2026Updated 3 months ago
- Composable and Embeddable Communication Runtime for Distributed AI Services☆101Jun 5, 2026Updated last week
- ☆19Nov 11, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Pytorch routines for (Ker)nel (Mac)hines☆12Oct 10, 2025Updated 8 months ago
- ☆13Jan 7, 2025Updated last year
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆192Feb 11, 2026Updated 4 months ago
- DeeperGEMM: crazy optimized version☆86May 5, 2025Updated last year
- A high-performance RL training-inference weight synchronization framework, designed to enable second-level parameter updates from trainin…☆160May 25, 2026Updated 2 weeks ago
- NVIDIA Inference Xfer Library (NIXL)☆1,079Updated this week
- A collection of workload implementations for the LDBC SNB benchmark driver☆20Jun 7, 2021Updated 5 years ago
- GPUDirect Async support for IB Verbs☆137Nov 10, 2022Updated 3 years ago
- a simple API to use CUPTI☆10Aug 19, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A lightweight design for computation-communication overlap.☆234Jan 20, 2026Updated 4 months ago
- ☆453Aug 10, 2025Updated 10 months ago
- ☆135Updated this week
- Artifact from "Hardware Compute Partitioning on NVIDIA GPUs". THIS IS A FORK OF BAKITAS REPO. I AM NOT ONE OF THE AUTHORS OF THE PAPER.☆64Nov 24, 2025Updated 6 months ago
- A fast communication-overlapping library for tensor/expert parallelism on GPUs.☆1,323Aug 28, 2025Updated 9 months ago
- Ring attention implementation with flash attention☆1,025Sep 10, 2025Updated 9 months ago
- train a model on huchenfeng dataset☆52Dec 8, 2025Updated 6 months ago
- ☆27Aug 31, 2023Updated 2 years ago
- A Top-Down Profiler for GPU Applications☆22Feb 29, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A simple demo for using Sentinel with Spring Cloud Alibaba☆17Nov 8, 2018Updated 7 years ago
- DeepXTrace is a lightweight tool for precisely diagnosing slow ranks in DeepEP-based environments.☆99Jan 16, 2026Updated 4 months ago
- CUDA 12.2 HMM demos☆21Jul 26, 2024Updated last year
- Fastest kernels written from scratch☆583Sep 18, 2025Updated 8 months ago
- [NeurIPS 2025] ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive☆73Dec 11, 2025Updated 6 months ago
- Sequence-level 1F1B schedule for LLMs.☆37Aug 26, 2025Updated 9 months ago
- Important experiments on memory management, file access, network transfer, job scheduler, and so on.☆15Apr 27, 2022Updated 4 years ago