yszheda / wikiLinks
my wiki tips
☆44Updated 8 years ago
Alternatives and similar repositories for wiki
Users that are interested in wiki are comparing it to the libraries listed below
Sorting:
- arm neon 相关文档和指令意义☆243Updated 6 years ago
- A stub opecl library that dynamically dlopen/dlsyms opencl implementations at runtime based on environment variables. Will be useful when…☆73Updated last year
- 作为对《Heterogeneous Computing with OpenCL 2.0》英文版的中文翻译。☆139Updated 4 years ago
- Fork of https://source.codeaurora.org/quic/hexagon_nn/nnlib☆58Updated 2 years ago
- Makes ARM NEON documentation accessible (with examples)☆400Updated last year
- Portable (POSIX/Windows/Emscripten) thread pool for C/C++☆373Updated last year
- mperf是一个面向移动/嵌入式平台的算子性能调优工具箱☆186Updated last year
- Learn OpenCL step by step.☆137Updated 2 years ago
- ☆61Updated 3 years ago
- Automatically exported from code.google.com/p/opencl-book-samples☆166Updated 5 years ago
- Khronos OpenVX Tutorial Material☆245Updated 3 years ago
- clone of https://code.google.com/p/opencl-book-samples (there's an official repo here https://github.com/bgaster/opencl-book-samples)☆45Updated 12 years ago
- there are guide examples for mobile cv algorithms optimization.☆28Updated 2 years ago
- The platform independent header allowing to compile any C/C++ code containing ARM NEON intrinsic functions for x86 target systems using S…☆466Updated 2 months ago
- symmetric int8 gemm☆66Updated 5 years ago
- VeriSilicon Tensor Interface Module☆235Updated 6 months ago
- pdf☆91Updated 7 years ago
- Smooth C/C++ Building Experience when using CMake☆37Updated this week
- a c++/cuda template library for tensor lazy evaluation☆161Updated 2 years ago
- OpenVX sample implementation☆143Updated last year
- how to design cpu gemm on x86 with avx256, that can beat openblas.☆70Updated 6 years ago
- Tests for ARM/Neon instructions, useful for compilers and simulators.☆40Updated 8 years ago
- ☆156Updated 4 months ago
- Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )☆60Updated 3 months ago
- Optimizing Mobile Deep Learning on ARM GPU with TVM☆181Updated 6 years ago
- The CMake version of cuda_by_example☆148Updated 4 years ago
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆204Updated 4 years ago
- ☆263Updated 7 years ago
- Automatically exported from code.google.com/p/sse2neon☆286Updated 4 years ago
- Arm neon optimization practice☆393Updated 4 years ago