NMSU-PEARL / GPUs-EnergyLinks

[CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs

☆15

Alternatives and similar repositories for GPUs-Energy

Users that are interested in GPUs-Energy are comparing it to the libraries listed below

Sorting:

Lin-Mao / DrGPUM
A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.
☆25Updated 9 months ago
lanl / PPT
Performance Prediction Toolkit
☆52Updated 7 months ago
spcl / daceml
A Data-Centric Compiler for Machine Learning
☆84Updated last year
Jokeren / GPA
GPU Performance Advisor
☆65Updated 2 years ago
hpdps-group / coccl
COCCL: Compression and precision co-aware collective communication library
☆24Updated 4 months ago
szcompressor / cuSZp
Fast GPU error-bounded lossy compressor for floating-point data.
☆41Updated 6 months ago
ekondis / gpuroofperf-toolkit
A GPU performance prediction toolkit for CUDA programs
☆17Updated 6 years ago
RIKEN-RCCS / hpl-ai
An HPL-AI implementation for Fugaku
☆21Updated 4 years ago
c3sr / comm_scope
NUMA-aware multi-CPU multi-GPU data transfer benchmarks
☆23Updated last year
pnnl / COMET
☆40Updated 2 weeks ago
NGIOproject / PMTutorial
Slides and exercises for persistent memory programming tutorial
☆13Updated 2 years ago
CoffeeBeforeArch / nvbit_tools
☆13Updated 4 years ago
microsoft / dist-ir
An IR for efficiently simulating distributed ML computation.
☆28Updated last year
GVProf / GVProf
GVProf: A Value Profiler for GPU-based Clusters
☆51Updated last year
brightlaboratory / polydl
☆12Updated 4 years ago
chai-benchmarks / chai
Chai
☆44Updated last year
olcf / NVIDIA-tensor-core-examples
☆18Updated 5 years ago
illinois-impact / klap
A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches
☆15Updated 6 years ago
HAWAIILAB / cuda-flux
CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels
☆32Updated 4 years ago
casys-kaist / EnvPipe
☆25Updated last year
jiazhihao / attention_superoptimizer
An Attention Superoptimizer
☆22Updated 5 months ago
ParCoreLab / Snoopie
Multi-GPU communication profiler and visualizer
☆31Updated last year
spcl / mlir-dace
Data-Centric MLIR dialect
☆42Updated last year
oresths / tSparse
A GPU algorithm for sparse matrix-matrix multiplication
☆71Updated 4 years ago
coreyjadams / CosmicTagger
Cosmic Tagging Network for Neutrino Physics
☆13Updated last year
cyanguwa / nersc-roofline
☆45Updated 4 years ago
uuudown / Tartan
Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite
☆66Updated 6 years ago
chhzh123 / Krill
An efficient concurrent graph processing system
☆46Updated 3 years ago
rapidsai / nvgraph
☆32Updated 4 years ago
KernelTuner / kernel_launcher
Using C++ magic to launch/capture CUDA kernels and tune them with Kernel Tuner
☆20Updated last year