gpudirect / libgdsync
GPUDirect Async support for IB Verbs
☆90Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for libgdsync
- GPUDirect example☆57Updated 3 years ago
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆63Updated this week
- ☆22Updated 3 years ago
- Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's☆80Updated 7 months ago
- Magnum IO community repo☆80Updated 5 months ago
- ROC_SHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆39Updated last year
- NCCL Profiling Kit☆112Updated 4 months ago
- oneAPI Collective Communications Library (oneCCL)☆206Updated this week
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆60Updated 6 years ago
- OFI Programmer's Guide☆49Updated last year
- FROZEN: the master branch has merged with the libfabric git repo☆31Updated 6 years ago
- Linux Cross-Memory Attach☆88Updated 2 months ago
- Pytorch process group third-party plugin for UCC☆20Updated 7 months ago
- CUPTI GPU Profiler☆37Updated 5 years ago
- Simple message passing library☆22Updated 6 years ago
- ☆36Updated 5 months ago
- tools to create performance and roofline plots from measured data☆58Updated 10 years ago
- example code for using DC QP for providing RDMA READ and WRITE operations to remote GPU memory☆104Updated 3 months ago
- Portals is a low-level network API for high-performance networking on high-performance computing systems developed by Sandia National Lab…☆34Updated 2 months ago
- verbs profiling library☆20Updated last year
- RDMA and SHARP plugins for nccl library☆162Updated last week
- Flexible GPGPU instrumentation☆86Updated 5 years ago
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆31Updated 2 years ago
- A hierarchical collective communications library with portable optimizations☆23Updated 4 months ago
- Unified Collective Communication Library☆207Updated last week
- A kernel module to enable RDMA transfers to/from IO, PFN and DAX mapped memory☆10Updated 9 years ago
- ☆47Updated 5 years ago
- OpenSHMEM Reference Implementation over UCX for Specification 1.4 and up☆33Updated last year
- Parallel Memory Bandwidth Measurement / Benchmark Tool☆104Updated 2 years ago
- ☆68Updated 8 years ago