Tengine gemm tutorial, step by step
☆13Mar 12, 2021Updated 5 years ago
Alternatives and similar repositories for Tengine_gemm_tutorial
Users that are interested in Tengine_gemm_tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- row-major matmul optimization☆713Feb 24, 2026Updated last month
- Base on retinaface and centerface modefied. frame work depend on pytorch.☆31Jul 23, 2020Updated 5 years ago
- symmetric int8 gemm☆67Jun 7, 2020Updated 5 years ago
- ☆22May 15, 2021Updated 4 years ago
- dabnn is an accelerated binary neural networks inference framework for mobile platform☆778Nov 12, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Generate a quantization parameter file for ncnn framework int8 inference☆518Jul 29, 2020Updated 5 years ago
- ☆2,006Jul 29, 2023Updated 2 years ago
- tools for MegaFace evaluation, e.g. plotting evalaution results☆21Jul 24, 2018Updated 7 years ago
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆957Apr 11, 2025Updated last year
- ☆17Sep 2, 2020Updated 5 years ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆627Feb 9, 2026Updated 2 months ago
- Arm neon optimization practice☆392Dec 22, 2020Updated 5 years ago
- Tengine example for run nnie devices☆31May 11, 2020Updated 5 years ago
- Tengine is a lite, high performance, modular inference engine for embedded device☆4,515Mar 6, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- toybrick rknn multithread c demo frame work☆21Jun 11, 2020Updated 5 years ago
- DDK for Rockchip NPU☆69Dec 29, 2020Updated 5 years ago
- PLCT实验室2019年开放日资料(OpenDay-2019)☆11Dec 20, 2019Updated 6 years ago
- mxnet version batch hard triplet loss☆13Aug 30, 2018Updated 7 years ago
- arm-neon☆93Aug 2, 2024Updated last year
- Simulate quantization and quantization aware training for MXNet-Gluon models.☆44Apr 17, 2020Updated 5 years ago
- ☆42Jun 25, 2020Updated 5 years ago
- ☆12May 20, 2020Updated 5 years ago
- A simple tool to encrypt caffe model to deploy☆13Oct 25, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Yet another Polyhedra Compiler for DeepLearning☆19Apr 14, 2023Updated 2 years ago
- Top-1 Acc=61.0% on ImageNet, without any sacrificing compared with SqueezeNet v1.1.☆22Jun 30, 2017Updated 8 years ago
- Mobile YOLOv3 object detector(person and face)☆10Jun 22, 2022Updated 3 years ago
- RetinaFace detector with C++☆396Jun 19, 2019Updated 6 years ago
- 适用于移动端的人脸识别模型,计算量与mobilefacenet相同,但megaface上提升了2%+☆232Apr 17, 2020Updated 5 years ago
- Everything in Torch Fx☆343Jun 7, 2024Updated last year
- AutoKernel 是一个简单易用,低门槛的自动算子优化工具,提高深度学习算法部署效率。☆747Sep 23, 2022Updated 3 years ago
- Empirical Study of Recent Face Alignment Methods☆13May 19, 2017Updated 8 years ago
- Caffe implementation of Dynamic Network Surgery and Incremental Network Quantization☆15Dec 13, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆12Aug 10, 2022Updated 3 years ago
- A primitive library for neural network☆1,368Nov 24, 2024Updated last year
- Facial landmarks training by mxnet(gluon). Use for face alignment and so on.☆40Mar 19, 2018Updated 8 years ago
- A Caffe implementation of EAST text detector☆17Mar 8, 2026Updated last month
- 使用mtcnn和o网络跟踪+光流跟踪进行多目标人脸跟踪,单目标人脸光流跟踪是0.5ms左右☆120Oct 11, 2019Updated 6 years ago
- detect human body from depth image☆16Jan 20, 2021Updated 5 years ago
- A demo using tensorRT on NVIDIA Jetson TX2 accelerating the Caffe model of AlexNet.☆28Jul 8, 2018Updated 7 years ago