gt-crnch-rg / ucx-tutorial-hot-interconnectsLinks
☆25Updated 3 years ago
Alternatives and similar repositories for ucx-tutorial-hot-interconnects
Users that are interested in ucx-tutorial-hot-interconnects are comparing it to the libraries listed below
Sorting:
- example code for using DC QP for providing RDMA READ and WRITE operations to remote GPU memory☆142Updated last year
- Repository for MLCommons Chakra schema and tools☆125Updated last month
- ☆182Updated 2 weeks ago
- A LogGOPS (LogP, LogGP, LogGPS) Simulator and Simulation Framework☆13Updated last year
- GPUDirect example☆60Updated 3 years ago
- TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches☆74Updated 2 years ago
- ☆18Updated 6 months ago
- NCCL Profiling Kit☆143Updated last year
- Demystifying Datapath Accelerator Enhanced Off-path SmartNIC [ICNP24]☆42Updated 9 months ago
- rFaaS: a high-performance FaaS platform with RDMA acceleration for low-latency invocations.☆53Updated 2 months ago
- ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale☆421Updated last week
- GPUDirect Async support for IB Verbs☆130Updated 2 years ago
- ☆40Updated last year
- ☆52Updated 2 months ago
- ☆24Updated 3 years ago
- Benchmark Test Suite for RDMA Networks☆56Updated 2 years ago
- Magnum IO community repo☆98Updated 3 weeks ago
- ATLAHS: An Application-centric Network Simulator Toolchain for AI, HPC, and Distributed Storage☆27Updated this week
- Microsoft Collective Communication Library☆360Updated last year
- This serves as a repository for reproducibility of the SC21 paper "In-Depth Analyses of Unified Virtual Memory System for GPU Accelerated…☆35Updated last year
- ☆16Updated 6 years ago
- Code samples related to Intel(R) AMX☆39Updated last year
- ☆76Updated 4 years ago
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆100Updated 2 years ago
- Arbitrary offloads for RDMA NICs☆97Updated 3 years ago
- Synthesizer for optimal collective communication algorithms☆116Updated last year
- [USENIX ATC 2021] Exploring the Design Space of Page Management for Multi-Tiered Memory Systems☆47Updated 3 years ago
- Benchmark Suite for RDMA Performance Isolation☆40Updated 2 years ago
- RDMA exmaple☆218Updated 3 years ago
- ☆71Updated 2 weeks ago