asvkarthick / Hexagon_DSP_programmingLinks
Repository to learn Hexagon DSP and HVX Programming
☆25Updated 6 years ago
Alternatives and similar repositories for Hexagon_DSP_programming
Users that are interested in Hexagon_DSP_programming are comparing it to the libraries listed below
Sorting:
- mperf是一个面向移动/嵌入式平台的算子性能调优工具箱☆192Updated 2 years ago
- arm-neon☆92Updated last year
- Qualcomm Hexagon NN Offload Framework☆45Updated 5 years ago
- ☆97Updated 4 years ago
- Fork of https://source.codeaurora.org/quic/hexagon_nn/nnlib☆58Updated 2 years ago
- code reading for tvm☆76Updated 4 years ago
- symmetric int8 gemm☆66Updated 5 years ago
- VeriSilicon Tensor Interface Module☆246Updated 3 weeks ago
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆202Updated 4 years ago
- how to design cpu gemm on x86 with avx256, that can beat openblas.☆73Updated 6 years ago
- ☆38Updated last year
- a tensor computing compiler based tile programming for gpu, cpu or tpu☆45Updated last week
- arm neon 相关文档和指令意义☆247Updated 6 years ago
- row-major matmul optimization☆701Updated 5 months ago
- examples for tvm schedule API☆101Updated 2 years ago
- heterogeneity-aware-lowering-and-optimization☆257Updated 2 years ago
- Arm neon optimization practice☆394Updated 5 years ago
- 动手学习TVM核心原理教程☆64Updated 5 years ago
- FlagTree is a unified compiler supporting multiple AI chip backends for custom Deep Learning operations, which is forked from triton-lang…☆211Updated this week
- ☆120Updated last year
- Tencent NCNN with added CUDA support☆71Updated 5 years ago
- A stub opecl library that dynamically dlopen/dlsyms opencl implementations at runtime based on environment variables. Will be useful when…☆74Updated last year
- ppl.cv is a high-performance image processing library of openPPL supporting various platforms.☆515Updated last year
- Common libraries for PPL projects☆31Updated 11 months ago
- ☆158Updated last year
- CUDA 6大并行计算模式 代码与笔记☆61Updated 5 years ago
- BLISlab: A Sandbox for Optimizing GEMM☆555Updated 4 years ago
- ☆19Updated last year
- Efficient operation implementation based on the Cambricon Machine Learning Unit (MLU) .☆150Updated 2 weeks ago
- CUDA Matrix Multiplication Optimization☆256Updated last year