nihui / vkpeak
A tool which profiles Vulkan devices to find their peak capacities
☆100Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for vkpeak
- Detect CPU features with single-file☆297Updated 3 weeks ago
- ☆17Updated 3 years ago
- Handy tools & graphics API abstraction for blazing fast prototyping☆9Updated 10 months ago
- A micro Vulkan compute pipeline and a collection of benchmarking compute shaders☆227Updated 3 months ago
- ☆37Updated last year
- Infere RWKV on NCNN☆48Updated 2 months ago
- A tool which profiles OpenCL devices to find their peak capacities☆413Updated 2 weeks ago
- prebuild package for cross compiling riscv☆18Updated 2 years ago
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆227Updated this week
- A profiler to disclose and quantify hardware features on GPUs.☆162Updated 2 years ago
- A small OpenCL benchmark program to measure peak GPU/CPU performance.☆164Updated this week
- Benchmark your NCNN models on 3DS(or crash)☆9Updated 7 months ago
- ☆57Updated 2 years ago
- rocWMMA☆92Updated this week
- AMD's graph optimization engine.☆186Updated this week
- A converter for llama2.c legacy models to ncnn models.☆82Updated 11 months ago
- ROCm's Thunk Interface☆83Updated 2 weeks ago
- Call ncnn from Fortran☆14Updated last year
- ☆136Updated last week
- Tencent NCNN with added CUDA support☆67Updated 3 years ago
- Fork of https://source.codeaurora.org/quic/hexagon_nn/nnlib☆54Updated last year
- rocDecode is a high performance video decode SDK for AMD hardware☆13Updated this week
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆363Updated 3 months ago
- BLIS fork with kernels for Apple M1. (Perhaps) The first open-source BLAS with Apple Matrix Coprocessor support.☆32Updated last year
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆33Updated 3 years ago
- ☆103Updated this week
- ☆72Updated 2 weeks ago
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆75Updated last week
- OpenAI Triton backend for Intel® GPUs☆143Updated this week