c3sr/comm_scope

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/c3sr/comm_scope)

c3sr / comm_scope

NUMA-aware multi-CPU multi-GPU data transfer benchmarks

☆28

Alternatives and similar repositories for comm_scope

Users that are interested in comm_scope are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cwpearson / llvmvm
View on GitHub
LLVM Version Manager
☆11Apr 21, 2017Updated 9 years ago
ChenyangZhang-cs / iMLBench
View on GitHub
iMLBench is a machine learning benchmark suite targeting CPU-GPU integrated architectures.
☆11May 29, 2021Updated 5 years ago
oneapi-src / ishmem
View on GitHub
Intel® SHMEM - Device initiated shared memory based communication library
☆33Nov 12, 2025Updated 8 months ago
e-ago / hpgmg-cuda-async
View on GitHub
GPUDirect Async implementation of HPGMG-FV CUDA
☆11May 11, 2018Updated 8 years ago
NVIDIA / grace-cpu-benchmarking-guide
View on GitHub
Guides and examples to help achieve optimal performance on a NVIDIA Grace CPU
☆17Aug 9, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
CSA-infra / RISCV-Scalable-Simulation-tutorial
View on GitHub
☆15Feb 2, 2026Updated 5 months ago
radix-io / hands-on
View on GitHub
Hands-on HPC I/O tutorial material
☆18Oct 9, 2025Updated 9 months ago
peaclab / HPAS
View on GitHub
HPC Performance Anomaly Suite
☆22Jun 11, 2020Updated 6 years ago
AMReX-Codes / ATPESC-codes
View on GitHub
Example codes for ATPESC
☆14Jul 31, 2025Updated 11 months ago
LangdalP / EPCC-OpenMP-micro-benchmarks
View on GitHub
A fork of the EPCC OpenMP micro-benchmark suite with some improvements
☆12Apr 27, 2017Updated 9 years ago
csc-training / high-level-gpu-programming
View on GitHub
CSC Training: High-Level GPU Programming
☆14Oct 16, 2025Updated 9 months ago
TUE-EE-ES / HalideAutoGPU
View on GitHub
☆11Sep 14, 2020Updated 5 years ago
ingonyama-zk / fast-danksharding
View on GitHub
Danksharding Builder with GPU acceleration
☆52Sep 10, 2023Updated 2 years ago
BullSequana / portails4
View on GitHub
This repository contains an implementation for Portals4. Portals4 is a Network Programming Interface which allows high-performance networ…
☆14Sep 3, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Bruce-Lee-LY / cuda_back2back_hgemm
View on GitHub
Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.
☆13Nov 3, 2023Updated 2 years ago
hashcloak / plonky2-merkle-trees
View on GitHub
☆16Jan 5, 2024Updated 2 years ago
Cray / pe-scripts
View on GitHub
Scripts for building libraries with Cray's PE
☆21Aug 31, 2021Updated 4 years ago
ScottKolo / suitesparse-matrix-collection-website
View on GitHub
A web interface for the SuiteSparse Matrix Collection, formerly known as the University of Florida Sparse Matrix Collection
☆25Jun 5, 2025Updated last year
mpi-advance / locality_aware
View on GitHub
Collective and Neighbor Collective Optimizations and Extensions
☆15Jul 14, 2026Updated last week
ROCm / rocSHMEM
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-systems repo
☆146Updated this week
slongle / GPU-Renderer
View on GitHub
Offline renderer using CUDA
☆13Jun 8, 2020Updated 6 years ago
tud-zih-energy / x86_adapt
View on GitHub
A Linux kernel module, that allows changing/toggling system parameters stored in MSR and PCI registers of x86 processors
☆16Mar 29, 2023Updated 3 years ago
maherharb / Autocomplete
View on GitHub
Next word prediction based on N-gram language model
☆11Jan 11, 2015Updated 11 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
githwxi / XATSHOME
View on GitHub
For hosting ATS3 and developing CodeDepot
☆18Jun 14, 2026Updated last month
josehu07 / cuckoo-hashing-CUDA
View on GitHub
Parallel cuckoo hashing on GPUs with CUDA
☆12Sep 27, 2019Updated 6 years ago
IBM / mpitrace
View on GitHub
library for measuring communication in distributed-memory parallel applications that use the standard Message-Passing Interface (MPI)
☆23Sep 17, 2025Updated 10 months ago
pluto / circom-correctly-constrained
View on GitHub
a reference on testing and constraining circom
☆21Oct 2, 2024Updated last year
JuliaConcurrent / Atomix.jl
View on GitHub
☆23Jun 19, 2026Updated last month
rubber-duck-debug / xielu
View on GitHub
A fast vectorized implementation of the XIELU activation function
☆21Oct 9, 2025Updated 9 months ago
pseXperiments / cuda-sumcheck
View on GitHub
Experimental implementation of Sumcheck protocol using CUDA
☆23Nov 14, 2024Updated last year
EnigmaHuang / Saad_Book_ForTran
View on GitHub
Some "Formula Translations" for Yousef Saad's book "Iterative Methods for Sparse Linear Systems (2nd Edition)"
☆13Jan 14, 2018Updated 8 years ago
Chair-for-Security-Engineering / ecmongpu
View on GitHub
ECM Factorization on CUDA-GPUs
☆16Sep 29, 2020Updated 5 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
ROCm / rocprof-compute-viewer
View on GitHub
☆61Updated this week
JeffBezanson / dataflow.jl
View on GitHub
introduction to dataflow analysis using julia
☆14Oct 26, 2020Updated 5 years ago
j-levy / bwa-gasal2
View on GitHub
BWA-MEM program accelerated with the GASAL2 library
☆19Sep 2, 2019Updated 6 years ago
uuudown / Tartan
View on GitHub
Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite
☆72Sep 12, 2018Updated 7 years ago
ECP-copa / CoMD
View on GitHub
Classical molecular dynamics proxy application.
☆31Jun 29, 2020Updated 6 years ago
VRGroupRWTH / mpi
View on GitHub
Header-only C++20 wrapper for MPI 4.0.
☆16Oct 20, 2023Updated 2 years ago
ROCm / rocm-xio
View on GitHub
A ROCm library for GPU-Initiated IO. This provides support for initiating IO from a ROCm-capable GPU against a range of targets including…
☆52Jul 10, 2026Updated last week