nihui / vkpeakLinks
A tool which profiles Vulkan devices to find their peak capacities
☆124Updated this week
Alternatives and similar repositories for vkpeak
Users that are interested in vkpeak are comparing it to the libraries listed below
Sorting:
- ☆18Updated 4 years ago
- Benchmark your NCNN models on 3DS(or crash)☆10Updated last year
- Handy tools & graphics API abstraction for blazing fast prototyping☆9Updated last year
- prebuild package for cross compiling riscv☆18Updated 3 years ago
- A micro Vulkan compute pipeline and a collection of benchmarking compute shaders☆236Updated 2 months ago
- Detect CPU features with single-file☆393Updated 3 weeks ago
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆277Updated this week
- A tool which profiles OpenCL devices to find their peak capacities☆449Updated last week
- ☆14Updated 2 months ago
- rocWMMA☆114Updated this week
- A small OpenCL benchmark program to measure peak GPU/CPU performance.☆212Updated 2 weeks ago
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆401Updated 4 months ago
- ☆174Updated last week
- ☆141Updated last week
- BLIS fork with kernels for Apple M1. (Perhaps) The first open-source BLAS with Apple Matrix Coprocessor support.☆35Updated 2 years ago
- ☆135Updated this week
- Marek's approach to building AMD GPU drivers for driver development☆25Updated 2 months ago
- ROCm's Thunk Interface☆91Updated 2 months ago
- A converter for llama2.c legacy models to ncnn models.☆87Updated last year
- AMD's graph optimization engine.☆220Updated this week
- A stub opecl library that dynamically dlopen/dlsyms opencl implementations at runtime based on environment variables. Will be useful when…☆73Updated last year
- Stretching GPU performance for GEMMs and tensor contractions.☆242Updated last week
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆39Updated 3 years ago
- OpenAI Triton backend for Intel® GPUs☆187Updated this week
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆83Updated this week
- ROCm Device Libraries☆97Updated last year
- Tencent NCNN with added CUDA support☆69Updated 4 years ago
- A profiler to disclose and quantify hardware features on GPUs.☆169Updated 3 years ago
- Infere RWKV on NCNN☆48Updated 8 months ago
- Encapsulate the frequently used AVX instructions as independent modules to reduce repeated development workload.☆121Updated last year