willhua / QualcommOpenCLSDKNote
The note of Qualcomm OpenCL SDK
☆25Updated 5 years ago
Related projects: ⓘ
- Qualcomm Hexagon NN Offload Framework☆40Updated 3 years ago
- A stub opecl library that dynamically dlopen/dlsyms opencl implementations at runtime based on environment variables. Will be useful when…☆67Updated 6 months ago
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆81Updated 7 months ago
- Fork of https://source.codeaurora.org/quic/hexagon_nn/nnlib☆53Updated last year
- assembler for NVIDIA FERMI. Imported from Google Code☆68Updated 9 years ago
- Tests for ARM/Neon instructions, useful for compilers and simulators.☆34Updated 7 years ago
- ☆20Updated this week
- ☆39Updated 3 years ago
- AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/Ope…☆53Updated this week
- how to design cpu gemm on x86 with avx256, that can beat openblas.☆64Updated 5 years ago
- ☆17Updated 4 years ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆76Updated 4 years ago
- An extension library of WMMA API (Tensor Core API)☆81Updated 2 months ago
- portDNN is a library implementing neural network algorithms written using SYCL☆106Updated 4 months ago
- ☆34Updated 3 years ago
- ☆48Updated 4 years ago
- CUDA PTX-ISA Document 中文翻译版☆23Updated 6 months ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆96Updated 7 years ago
- Learn OpenCL step by step.☆127Updated 2 years ago
- The OpenCL Conformance Tests☆184Updated this week
- mperf是一个面向移动/嵌入式平台的算子性能调优工具箱☆169Updated last year
- Subpart source code of of deepcore v0.7☆27Updated 4 years ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆74Updated last year
- flexible-gemm conv of deepcore☆17Updated 4 years ago
- Winograd-based convolution implementation in OpenCL☆27Updated 7 years ago
- ☆24Updated this week
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆33Updated 2 years ago
- OpenVX API and Extension Registry.☆45Updated 4 months ago
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)☆33Updated 3 months ago
- Sparse-dense matrix-matrix multiplication on GPUs☆14Updated 5 years ago