NVlabs / SASSI
Flexible GPGPU instrumentation
☆86Updated 5 years ago
Alternatives and similar repositories for SASSI:
Users that are interested in SASSI are comparing it to the libraries listed below
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆81Updated 5 years ago
- assembler for NVIDIA FERMI. Imported from Google Code☆72Updated 10 years ago
- Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)☆124Updated 2 years ago
- ☆51Updated 5 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆110Updated 2 years ago
- A Benchmark Suite for Heterogeneous System Computation☆53Updated last month
- The SHOC Benchmark Suite☆251Updated 3 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- ☆236Updated last month
- ROCm - AMDGPU Compute Application Binary Interface☆41Updated 3 years ago
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆117Updated 2 years ago
- An Open Source Kepler GPU Assembler☆20Updated 8 years ago
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- Decuda and cudasm, the CUDA binary utilities package. Low-level tools for NVidia G80 GPUs.☆98Updated 14 years ago
- Performance Prediction Toolkit☆51Updated 3 months ago
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)☆34Updated 2 weeks ago
- ☆24Updated 5 years ago
- The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github…☆32Updated last week
- Stretching GPU performance for GEMMs and tensor contractions.☆234Updated 2 weeks ago
- Emulating DMA Engines on GPUs for Performance and Portability☆38Updated 9 years ago
- SST Structural Simulation Toolkit Parallel Discrete Event Core and Services☆141Updated this week
- ☆53Updated 5 years ago
- Chai☆43Updated last year
- ☆59Updated 5 months ago
- ☆61Updated 3 months ago
- Assembler for NVIDIA Volta and Turing GPUs☆214Updated 3 years ago
- TLB Benchmarks☆33Updated 7 years ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆132Updated this week
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆28Updated 6 months ago
- ROC profiler library. Profiling with perf-counters and derived metrics.☆138Updated this week