ParCoreLab/CPU-Free-model

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ParCoreLab/CPU-Free-model)

ParCoreLab / CPU-Free-model

Source code for the CPU-Free model - a fully autonomous execution model for multi-GPU applications that completely excludes the involvement of the CPU beyond the initial kernel launch.

☆21

Alternatives and similar repositories for CPU-Free-model

Users that are interested in CPU-Free-model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ParCoreLab / ComScribe
View on GitHub
ComScribe is a tool to identify communication among all GPU-GPU and CPU-GPU pairs in a single-node multi-GPU system.
☆27Jul 6, 2023Updated 3 years ago
ParCoreLab / ReuseTracker
View on GitHub
A fast and accurate reuse distance analyzer for multi-threaded applications. It leverages existing hardware features in commodity CPUs.
☆21Feb 3, 2023Updated 3 years ago
mabdullahsoyturk / HPC-Paper-Notes
View on GitHub
My notes on various HPC papers.
☆27Jan 7, 2023Updated 3 years ago
e-ago / hpgmg-cuda-async
View on GitHub
GPUDirect Async implementation of HPGMG-FV CUDA
☆11May 11, 2018Updated 8 years ago
YukeWang96 / MGG_OSDI23
View on GitHub
Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…
☆40Mar 17, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
hpcac / 2023-APAC-HPC-AI
View on GitHub
☆12Sep 28, 2023Updated 2 years ago
SU-HPC / GOSH
View on GitHub
An ultra-fast, GPU-based large graph embedding algorithm utilizing a novel coarsening algorithm requiring not more than a single GPU.
☆24Jan 3, 2022Updated 4 years ago
leefige / radik
View on GitHub
Scalable radix top-k selection on GPUs.
☆23Jan 27, 2025Updated last year
HomeOfVapourSynthEvolution / VapourSynth-ReadMpls
View on GitHub
ReadMpls filter for VapourSynth
☆12Oct 5, 2021Updated 4 years ago
facebookresearch / FAMBench
View on GitHub
Benchmarks to capture important workloads.
☆32Apr 1, 2026Updated 3 months ago
IFeelBloated / Plum
View on GitHub
☆11Mar 27, 2021Updated 5 years ago
closest-git / GSS
View on GitHub
best CPU/GPU sparse solver for large sparse matrices
☆21Oct 5, 2021Updated 4 years ago
linnanwang / BLASX
View on GitHub
a heterogeneous multiGPU level-3 BLAS library
☆46Dec 9, 2019Updated 6 years ago
ECP-ExaGraph / grappolo
View on GitHub
OpenMP implementation of Graph Community Detection, with a number of parallel heuristics/approximate computing techniques
☆23Jun 15, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
gpudirect / gdasync
View on GitHub
GPUDirect Async suite
☆16Dec 5, 2018Updated 7 years ago
notini / csr-formatter
View on GitHub
C++ package to store Matrix Market (.mtx) file format sparse matrices in Compressed Row Storage (CSR) format.
☆17Oct 16, 2019Updated 6 years ago
pfnet-research / allreduce-proto
View on GitHub
A prototype implementation of AllReduce collective communication routine.
☆19Sep 27, 2018Updated 7 years ago
sorayuki / TawawaFilter
View on GitHub
AviSynth filter to make "Tawawa of Monday" in blue color
☆10Oct 20, 2016Updated 9 years ago
ROCm / iris
View on GitHub
AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming
☆193Updated this week
AlexeyPechnikov / ParaView-Blender-AR
View on GitHub
iOS/iPadOS/Android assure the best AR
☆20Feb 13, 2024Updated 2 years ago
GoFEM / pyGoFEM
View on GitHub
A python front-end for the GoFEM modelling and inversion code
☆16Jun 15, 2026Updated last month
apuaaChen / vectorSparse
View on GitHub
☆32Aug 24, 2022Updated 3 years ago
NVlabs / Parsimony-CGO23
View on GitHub
☆15Jan 11, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
yoshiya-usui / TRACMT
View on GitHub
Robust transfer function analysis code for magnetotellurics
☆15Apr 14, 2026Updated 3 months ago
NVIDIA / multi-gpu-programming-models
View on GitHub
Examples demonstrating available options to program multiple GPUs in a single node or a cluster
☆908Sep 26, 2025Updated 9 months ago
hongbo-yao / BayesMTGDS
View on GitHub
Trans-dimensional Bayesian joint inversion of magnetotelluric and geomagnetic depth sounding responses to constrain mantle electrical dis…
☆19Apr 1, 2024Updated 2 years ago
poojahira / spmv-cuda
View on GitHub
Implementation and analysis of five different GPU based SPMV algorithms in CUDA
☆39Feb 5, 2019Updated 7 years ago
spcl / sten
View on GitHub
Sparsity support for PyTorch
☆39Mar 22, 2025Updated last year
MrSidims / PytorchExplorer
View on GitHub
An interactive web-based tool for exploring intermediate representations of PyTorch and Triton models
☆49Jan 23, 2026Updated 5 months ago
dubhatervapoursynth / D2VWitch
View on GitHub
Cross-platform D2V creator
☆39Sep 25, 2023Updated 2 years ago
kice / vs_mxDnCNN
View on GitHub
A mxnet implement of the paper "Beyond a Gaussian Denoiser : Residual Learning of Deep CNN for Image Denoising" for VapourSynth
☆14Dec 20, 2017Updated 8 years ago
UofT-EcoSystem / Tempo
View on GitHub
Memory footprint reduction for transformer models
☆11Jan 24, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
pnnl / E4D
View on GitHub
☆20Dec 28, 2021Updated 4 years ago
mysteryx93 / VapourSynthViewer.NET
View on GitHub
VapourSynth Video Script Viewer API for .NET
☆14Oct 12, 2019Updated 6 years ago
jellyterra / spacemit-k1-archlinux
View on GitHub
Arch Linux RISC-V images for Banana Pi F3 with SpacemiT K1 / M1 / X60.
☆13Dec 21, 2025Updated 6 months ago
NVlabs / ptxmemorymodel
View on GitHub
☆77May 29, 2019Updated 7 years ago
AkarinVS / vapoursynth-plugin
View on GitHub
My experimental VapourSynth plugin: (1) an enhanced LLVM-based std.Expr (aka lexpr), Select, PropExpr, Text and Tmpl. (2) DLISR. (3) DLVF…
☆42Nov 18, 2023Updated 2 years ago
justine18 / performance_experiment
View on GitHub
Performance experiment - Pyomo vs JuMP
☆12Aug 3, 2023Updated 2 years ago
IaroslavElistratov / triton-autodiff
View on GitHub
☆19Nov 11, 2025Updated 8 months ago