nihui / vkpeakLinks
A tool which profiles Vulkan devices to find their peak capacities
☆148Updated 2 weeks ago
Alternatives and similar repositories for vkpeak
Users that are interested in vkpeak are comparing it to the libraries listed below
Sorting:
- Detect CPU features with single-file☆432Updated last week
- Benchmark your NCNN models on 3DS(or crash)☆10Updated last year
- ncnn android vkpeak☆24Updated last month
- A small OpenCL benchmark program to measure peak GPU/CPU performance.☆262Updated 2 weeks ago
- Derived from Nemes' gpuperftests☆33Updated last year
- A tool which profiles OpenCL devices to find their peak capacities☆474Updated 5 months ago
- A micro Vulkan compute pipeline and a collection of benchmarking compute shaders☆255Updated 7 months ago
- A profiler to disclose and quantify hardware features on GPUs.☆174Updated 3 years ago
- LLM inference in C/C++☆20Updated last month
- prebuild package for cross compiling riscv☆17Updated 3 years ago
- Implementation of OpenCL 3.0 on Vulkan☆413Updated 2 weeks ago
- AMD's graph optimization engine.☆266Updated this week
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆305Updated this week
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆44Updated 4 years ago
- BLIS fork with kernels for Apple M1. (Perhaps) The first open-source BLAS with Apple Matrix Coprocessor support.☆36Updated 2 years ago
- ☆183Updated 2 months ago
- A stub opecl library that dynamically dlopen/dlsyms opencl implementations at runtime based on environment variables. Will be useful when…☆74Updated last year
- Infere RWKV on NCNN☆49Updated last year
- Encapsulate the frequently used AVX instructions as independent modules to reduce repeated development workload.☆127Updated last year
- ROCm's Thunk Interface☆91Updated 8 months ago
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆427Updated 10 months ago
- ☆28Updated 4 years ago
- ☆143Updated last week
- The OpenCL Conformance Tests☆218Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆269Updated last week
- AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/Ope…☆68Updated last week
- ☆154Updated this week
- Marek's approach to building AMD GPU drivers for driver development☆28Updated last month
- A converter for llama2.c legacy models to ncnn models.☆80Updated last year
- Tencent NCNN with added CUDA support☆71Updated 4 years ago