waau / qualcomm-nnlibLinks

Qualcomm Hexagon NN Offload Framework

☆43

Alternatives and similar repositories for qualcomm-nnlib

Users that are interested in qualcomm-nnlib are comparing it to the libraries listed below

Sorting:

XiaoMi / nnlib
Fork of https://source.codeaurora.org/quic/hexagon_nn/nnlib
☆58Updated 2 years ago
alibaba / heterogeneity-aware-lowering-and-optimization
heterogeneity-aware-lowering-and-optimization
☆256Updated last year
tlc-pack / tophub
tophub autotvm log collections
☆69Updated 2 years ago
FrozenGene / tvm-tutorial
TVM tutorial
☆66Updated 6 years ago
vinx13 / tvm-cuda-int8-benchmark
Benchmark of TVM quantized model on CUDA
☆111Updated 5 years ago
tpoisonooo / chgemm
symmetric int8 gemm
☆67Updated 5 years ago
AI-performance / embedded-ai.bench
benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.
☆204Updated 4 years ago
merrymercy / tvm-mali
Optimizing Mobile Deep Learning on ARM GPU with TVM
☆181Updated 7 years ago
tlc-pack / TLCBench
Benchmark scripts for TVM
☆74Updated 3 years ago
BBuf / how-to-optimize-gemm
☆98Updated 4 years ago
xuqiantong / CUDA-Winograd
Fast CUDA Kernels for ResNet Inference.
☆180Updated 6 years ago
tobegit3hub / tftvm
TensorFlow and TVM integration
☆36Updated 5 years ago
carlushuang / cpu_gemm_opt
how to design cpu gemm on x86 with avx256, that can beat openblas.
☆71Updated 6 years ago
StrongSpoon / tvm.schedule
examples for tvm schedule API
☆101Updated 2 years ago
FrozenGene / tflite
TFLite python API package for parsing TFLite model
☆12Updated 5 years ago
masahi / torchscript-to-tvm
☆68Updated 2 years ago
pigirons / conv3x3_m1
This is a demo how to write a high performance convolution run on apple silicon
☆56Updated 3 years ago
whitelok / tvm-lesson
动手学习TVM核心原理教程
☆63Updated 4 years ago
mit-han-lab / inter-operator-scheduler
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
☆200Updated 3 years ago
OpenPPL / CuAssembler
An unofficial cuda assembler, for all generations of SASS, hopefully ：）
☆83Updated 2 years ago
MegEngine / mperf
mperf是一个面向移动/嵌入式平台的算子性能调优工具箱
☆190Updated 2 years ago
XiuYuLi / flexible-gemm
flexible-gemm conv of deepcore
☆17Updated 5 years ago
VeriSilicon / TIM-VX
VeriSilicon Tensor Interface Module
☆238Updated 2 weeks ago
CharlieCurry / tvm-learning
TVM learning and research
☆13Updated 4 years ago
XiuYuLi / deepcore_source_code
Subpart source code of of deepcore v0.7
☆27Updated 5 years ago
OpenPPL / ppl.kernel.cpu
☆18Updated last year
NVIDIA / sampleQAT
Inference of quantization aware trained networks using TensorRT
☆83Updated 2 years ago
andersy005 / tvm-in-action
TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together
☆64Updated 7 years ago
xingyul / sparse-winograd-cnn
Efficient Sparse-Winograd Convolutional Neural Networks (ICLR 2018)
☆193Updated 6 years ago
BBuf / ArmNeonOptimization
arm-neon
☆92Updated last year