XiaoMi / nnlibLinks

Fork of https://source.codeaurora.org/quic/hexagon_nn/nnlib

☆58

Alternatives and similar repositories for nnlib

Users that are interested in nnlib are comparing it to the libraries listed below

Sorting:

waau / qualcomm-nnlib
Qualcomm Hexagon NN Offload Framework
☆43Updated 4 years ago
merrymercy / tvm-mali
Optimizing Mobile Deep Learning on ARM GPU with TVM
☆181Updated 7 years ago
tpoisonooo / chgemm
symmetric int8 gemm
☆67Updated 5 years ago
AI-performance / embedded-ai.bench
benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.
☆204Updated 4 years ago
carlushuang / cpu_gemm_opt
how to design cpu gemm on x86 with avx256, that can beat openblas.
☆71Updated 6 years ago
XiuYuLi / flexible-gemm
flexible-gemm conv of deepcore
☆17Updated 5 years ago
alibaba / heterogeneity-aware-lowering-and-optimization
heterogeneity-aware-lowering-and-optimization
☆256Updated last year
vinx13 / tvm-cuda-int8-benchmark
Benchmark of TVM quantized model on CUDA
☆111Updated 5 years ago
VeriSilicon / TIM-VX
VeriSilicon Tensor Interface Module
☆237Updated last week
MegEngine / mperf
mperf是一个面向移动/嵌入式平台的算子性能调优工具箱
☆190Updated 2 years ago
FrozenGene / tvm-tutorial
TVM tutorial
☆66Updated 6 years ago
lyuchuny3 / Tengine_gemm_tutorial
Tengine gemm tutorial, step by step
☆13Updated 4 years ago
XiuYuLi / deepcore_source_code
Subpart source code of of deepcore v0.7
☆27Updated 5 years ago
OpenPPL / ppl.common
Common libraries for PPL projects
☆29Updated 7 months ago
BBuf / ArmNeonOptimization
arm-neon
☆92Updated last year
xuqiantong / CUDA-Winograd
Fast CUDA Kernels for ResNet Inference.
☆180Updated 6 years ago
strin / gemm-android
tutorial to optimize GEMM performance on android
☆51Updated 9 years ago
XiaoMi / mobile-ai-bench
Benchmarking Neural Network Inference on Mobile Devices
☆383Updated 2 years ago
VeriSilicon / acuity-models
Acuity Model Zoo
☆147Updated last month
tobegit3hub / tftvm
TensorFlow and TVM integration
☆36Updated 5 years ago
tlc-pack / tophub
tophub autotvm log collections
☆69Updated 2 years ago
FrozenGene / tflite
TFLite python API package for parsing TFLite model
☆12Updated 5 years ago
krrishnarraj / libopencl-stub
A stub opecl library that dynamically dlopen/dlsyms opencl implementations at runtime based on environment variables. Will be useful when…
☆74Updated last year
BBuf / how-to-optimize-gemm
☆98Updated 4 years ago
hey-yahei / Quantization.MXNet
Simulate quantization and quantization aware training for MXNet-Gluon models.
☆45Updated 5 years ago
whitelok / tvm-lesson
动手学习TVM核心原理教程
☆63Updated 4 years ago
Cambricon / mlu-ops
Efficient operation implementation based on the Cambricon Machine Learning Unit (MLU) .
☆134Updated last week
xiaoweiChen / Heterogeneous-Computing-with-OpenCL-2.0
作为对《Heterogeneous Computing with OpenCL 2.0》英文版的中文翻译。
☆140Updated 5 years ago
CAS-CLab / CNN-Inference-Engine-Quick-View
A quick view of high-performance convolution neural networks (CNNs) inference engines on mobile devices.
☆151Updated 3 years ago
atanmarko / ncnn-with-cuda
Tencent NCNN with added CUDA support
☆69Updated 4 years ago