NVlabs / SASSI
Flexible GPGPU instrumentation
☆85Updated 4 years ago
Related projects: ⓘ
- assembler for NVIDIA FERMI. Imported from Google Code☆68Updated 9 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆97Updated last year
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆96Updated 7 years ago
- A Benchmark Suite for Heterogeneous System Computation☆52Updated last week
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆76Updated 4 years ago
- The SHOC Benchmark Suite☆243Updated 2 years ago
- Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)☆116Updated 2 years ago
- ☆48Updated 4 years ago
- ☆44Updated 5 years ago
- ROCm - AMDGPU Compute Application Binary Interface☆40Updated 2 years ago
- Chai☆41Updated 9 months ago
- Performance Prediction Toolkit☆51Updated 2 years ago
- MIOpenGEMM is now deprecated☆61Updated last year
- ☆218Updated 3 weeks ago
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆107Updated last year
- Implement asm gemm on vega64 for 4096x4096 fp32 matrix☆19Updated 4 years ago
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)☆33Updated 3 months ago
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- ☆58Updated 2 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆106Updated 3 months ago
- Stretching GPU performance for GEMMs and tensor contractions.☆213Updated this week
- HCC Sample Applications☆13Updated 7 years ago
- ☆39Updated 3 years ago
- An Open Source Kepler GPU Assembler☆19Updated 7 years ago
- ROC profiler library. Profiling with perf-counters and derived metrics.☆124Updated this week
- development repository for the open earth compiler☆74Updated 3 years ago
- A benchmarking suite for heterogeneous systems. The primary goal of this project is to improve and update aspects of existing benchmarkin…☆37Updated 6 months ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆167Updated last year
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆27Updated last year
- Decuda and cudasm, the CUDA binary utilities package. Low-level tools for NVidia G80 GPUs.☆94Updated 14 years ago