gt-crnch-rg / ucx-tutorial-hot-interconnectsLinks
☆26Updated 3 years ago
Alternatives and similar repositories for ucx-tutorial-hot-interconnects
Users that are interested in ucx-tutorial-hot-interconnects are comparing it to the libraries listed below
Sorting:
- example code for using DC QP for providing RDMA READ and WRITE operations to remote GPU memory☆152Updated last year
- ☆22Updated 10 months ago
- GPUDirect example☆60Updated 4 years ago
- ☆42Updated last month
- Demystifying Datapath Accelerator Enhanced Off-path SmartNIC [ICNP24]☆54Updated last year
- Repository for MLCommons Chakra schema and tools☆146Updated 2 months ago
- ☆210Updated last month
- Magnum IO community repo☆109Updated last month
- ☆41Updated 2 years ago
- TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches☆79Updated 2 years ago
- NCCL Profiling Kit☆150Updated last year
- GPUDirect Async support for IB Verbs☆134Updated 3 years ago
- ☆25Updated 3 years ago
- Benchmark Test Suite for RDMA Networks☆58Updated 2 years ago
- ☆16Updated 6 years ago
- Benchmark Suite for RDMA Performance Isolation☆40Updated 2 years ago
- This serves as a repository for reproducibility of the SC21 paper "In-Depth Analyses of Unified Virtual Memory System for GPU Accelerated…☆38Updated 2 years ago
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆68Updated 7 years ago
- LineFS: Efficient SmartNIC Offload of a Distributed File System with Pipeline Parallelism☆89Updated 4 years ago
- RDMA exmaple☆231Updated 3 years ago
- Synthesizer for optimal collective communication algorithms☆123Updated last year
- ☆32Updated 5 years ago
- A collection of tools, code, and documentation to understand the host network on real server hardware.☆44Updated last year
- ☆40Updated 4 years ago
- [USENIX ATC 2021] Exploring the Design Space of Page Management for Multi-Tiered Memory Systems☆48Updated 3 years ago
- Repository for MLCommons Chakra schema and tools☆39Updated 2 years ago
- ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale☆505Updated last week
- ☆69Updated 3 years ago
- ATLAHS: An Application-centric Network Simulator Toolchain for AI, HPC, and Distributed Storage☆60Updated last month
- Pond: CXL-Based Memory Pooling Systems for Cloud Platforms (ASPLOS'23)☆216Updated last year