willhua / QualcommOpenCLSDKNote
The note of Qualcomm OpenCL SDK
☆34Updated 6 years ago
Alternatives and similar repositories for QualcommOpenCLSDKNote:
Users that are interested in QualcommOpenCLSDKNote are comparing it to the libraries listed below
- mperf是一个面向移动/嵌入式平台的算子性能 调优工具箱☆182Updated last year
- A stub opecl library that dynamically dlopen/dlsyms opencl implementations at runtime based on environment variables. Will be useful when…☆72Updated last year
- ☆39Updated 3 years ago
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆84Updated last year
- BGHT: High-performance static GPU hash tables.☆63Updated 2 weeks ago
- ☆8Updated last year
- clone of https://code.google.com/p/opencl-book-samples (there's an official repo here https://github.com/bgaster/opencl-book-samples)☆44Updated 12 years ago
- Qualcomm Hexagon NN Offload Framework☆42Updated 4 years ago
- The OpenCL Conformance Tests☆201Updated last week
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- Fork of https://source.codeaurora.org/quic/hexagon_nn/nnlib☆57Updated 2 years ago
- Common libraries for PPL projects☆29Updated last month
- flexible-gemm conv of deepcore☆17Updated 5 years ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆82Updated 2 years ago
- 作为对《Heterogeneous Computing with OpenCL 2.0》英文版的中文翻译。☆133Updated 4 years ago
- Automatically exported from code.google.com/p/opencl-book-samples☆165Updated 5 years ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆81Updated 5 years ago
- Handy tools & graphics API abstraction for blazing fast prototyping☆9Updated last year
- Learn OpenCL step by step.☆135Updated 2 years ago
- AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/Ope…☆61Updated this week
- ☆109Updated last year
- ☆124Updated 12 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated 11 months ago
- An extension library of WMMA API (Tensor Core API)☆96Updated 9 months ago
- Example of how to use CUDA with CMake >= 3.8☆69Updated last year
- CUDA PTX-ISA Document 中文翻译版☆38Updated last month
- Subpart source code of of deepcore v0.7☆27Updated 4 years ago
- CuPBoP-AMD is a CUDA translator that translates CUDA programs at NVVM IR level to HIP-compatible IR that can run on AMD GPUs.☆36Updated last year
- pdf☆89Updated 6 years ago
- how to design cpu gemm on x86 with avx256, that can beat openblas.☆70Updated 6 years ago