decodecudabinary / Decoding-CUDA-BinaryLinks

☆52

Alternatives and similar repositories for Decoding-CUDA-Binary

Users that are interested in Decoding-CUDA-Binary are comparing it to the libraries listed below

Sorting:

hkust-adsl / gass
☆38Updated 3 years ago
PAA-NCIC / PPoPP2017_artifact
Third party assembler and GEMM library for NVIDIA Kepler GPU
☆81Updated 5 years ago
NVlabs / ptxmemorymodel
☆64Updated 6 years ago
apc-llc / nvcc-llvm-ir
Enabling on-the-fly manipulations with LLVM IR code of CUDA sources
☆112Updated 3 months ago
sderek / CUDAAdvisor
CUDAAdvisor: a GPU profiling tool
☆49Updated 6 years ago
spcl / open-earth-compiler
development repository for the open earth compiler
☆80Updated 4 years ago
lanl / PPT
Performance Prediction Toolkit
☆52Updated 7 months ago
Jokeren / GPA
GPU Performance Advisor
☆65Updated 3 years ago
sjfeng1999 / gpu-arch-microbenchmark
Dissecting NVIDIA GPU Architecture
☆103Updated 3 years ago
daadaada / gas
☆45Updated 4 years ago
hyqneuron / asfermi
assembler for NVIDIA FERMI. Imported from Google Code
☆72Updated 10 years ago
NVlabs / NVBit
☆270Updated last month
ROCm / rocMLIR
☆148Updated this week
Meinersbur / ppcg
Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)
☆127Updated 3 years ago
NVlabs / SASSI
Flexible GPGPU instrumentation
☆88Updated 5 years ago
ekondis / gpumembench
A GPU benchmark suite for assessing on-chip GPU memory bandwidth
☆106Updated 7 years ago
GVProf / GVProf
GVProf: A Value Profiler for GPU-based Clusters
☆51Updated last year
sunlex0717 / DissectingTensorCores
☆106Updated last year
c3sr / tcu_scope
☆51Updated 6 years ago
NUCAR-DEV / Hetero-Mark
A Benchmark Suite for Heterogeneous System Computation
☆53Updated 5 months ago
accel-sim / gpu-app-collection
A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.
☆70Updated 2 weeks ago
uuudown / Tartan
Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite
☆66Updated 6 years ago
kumasento / polymer
Bridging polyhedral analysis tools to the MLIR framework
☆116Updated last year
iml130 / mlir-emitc
Conversions to MLIR EmitC
☆128Updated 7 months ago
gcoe-dresden / cuda-gpu-tlb
TLB Benchmarks
☆34Updated 7 years ago
mattsinc / heterosync
HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs
☆30Updated 10 months ago
codyjrivera / tsm2x-imp
Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA
☆35Updated 5 years ago
HAWAIILAB / cuda-flux
CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels
☆32Updated 4 years ago
wzh99 / relay-mlir
An MLIR-based toy DL compiler for TVM Relay.
☆58Updated 2 years ago
vortexgpgpu / NVPTX-SPIRV-Translator
The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.
☆40Updated 3 years ago