Open source version of DOCA GPUNetIO and DOCA Verbs libraries (limited features) to enable GDAKI technology on RDMA (IB and RoCE)
☆31Feb 27, 2026Updated this week
Alternatives and similar repositories for gpunetio
Users that are interested in gpunetio are comparing it to the libraries listed below
Sorting:
- Official implementation of Acc-SpMM: Accelerating General-purpose Sparse Matrix-Matrix Multiplication with GPU Tensor Cores.☆14Nov 13, 2025Updated 3 months ago
- ☆46Feb 16, 2026Updated 2 weeks ago
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆35Nov 18, 2025Updated 3 months ago
- ☆25Sep 1, 2025Updated 6 months ago
- Memory Topology for GPUs☆17Feb 13, 2026Updated 2 weeks ago
- Implementation of GraphReader paper: https://arxiv.org/abs/2406.14550☆13Oct 21, 2024Updated last year
- LONGAGENT: Scaling Language Models to 128k Context through Multi-Agent Collaboration☆11Mar 11, 2024Updated last year
- Memory experiments with LLMs☆11Mar 31, 2023Updated 2 years ago
- NVIDIA Networking NIC Configuration Operator For Kubernetes☆14Updated this week
- 校园疫情防空系统前端☆14Dec 3, 2022Updated 3 years ago
- Utility functions/scripts for working with GPUs.☆10Jul 5, 2021Updated 4 years ago
- Learning TileLang with 10 puzzles!☆143Updated this week
- Python library to add support for embedding natural code in Python with shared program state.☆23Jan 20, 2026Updated last month
- TopViewRS: Vision-Language Models as Top-View Spatial Reasoners (EMNLP 2024 Oral)☆15Jun 14, 2025Updated 8 months ago
- Spatial Transformer Network (STN) provides attention to a particular region to in an image, by doing transformation to the input image. T…☆15Dec 21, 2020Updated 5 years ago
- ☆13Nov 2, 2022Updated 3 years ago
- Repositorio para estudiar para el final de Algoritmos 3☆15Oct 23, 2018Updated 7 years ago
- The Zaychik Power Controller server☆13Apr 13, 2024Updated last year
- The evaluation code for the paper "MoreHopQA: More Than Multi-hop Reasoning"☆14Jun 21, 2024Updated last year
- ECCV' 2024.☆14Sep 11, 2024Updated last year
- Simulating Distributed Training at Scale☆14Sep 15, 2025Updated 5 months ago
- Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"☆77Oct 15, 2025Updated 4 months ago
- ☆15Feb 17, 2026Updated 2 weeks ago
- Light-weight Performance Variance Detection for Production-run Parallel Applications☆16Aug 28, 2023Updated 2 years ago
- ☆19Jun 29, 2025Updated 8 months ago
- Mirror of http://gitlab.hpcrl.cse.ohio-state.edu/chong/ppopp19_ae, refactoring for understanding☆15Oct 20, 2021Updated 4 years ago
- 北 航计算机网络个人学习笔记☆15Nov 10, 2020Updated 5 years ago
- A small RISC-V kernel coding by C, tested on sifive unmatched board.☆16Aug 20, 2022Updated 3 years ago
- ☆16Feb 5, 2024Updated 2 years ago
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆21Feb 5, 2026Updated 3 weeks ago
- Vortex: A Flexible and Efficient Sparse Attention Framework☆48Jan 21, 2026Updated last month
- ☆18Oct 31, 2025Updated 4 months ago
- ☆32Jul 2, 2025Updated 8 months ago
- 语雀 Claude Code Plugin — 一键集成语雀 AI 能力☆42Updated this week
- [TBD] "m4: A Learned Flow-level Network Simulator" by Chenning Li, Anton A. Zabreyko, Om Chabra, Arash Nasr-Esfahany, Kevin Zhao, Pratees…☆16Nov 18, 2025Updated 3 months ago
- ☆18Oct 15, 2020Updated 5 years ago
- Sequence-level 1F1B schedule for LLMs.☆19Jun 4, 2024Updated last year
- DeepSeek-V3.2-Exp DSA Warmup Lightning Indexer training operator based on tilelang☆44Nov 19, 2025Updated 3 months ago
- The code for our paper "Neural Architecture Search as Program Transformation Exploration"☆16Apr 28, 2021Updated 4 years ago