lovelyzzkei / QNN-Android-ServerLinks
Let's use Qualcomm NPU in Android
☆11Updated 6 months ago
Alternatives and similar repositories for QNN-Android-Server
Users that are interested in QNN-Android-Server are comparing it to the libraries listed below
Sorting:
- PyTorch Quantization Aware Training(QAT,量化感知训练)☆35Updated last year
- Demonstration of combine YOLO and depth estimation on Android device.☆57Updated 2 weeks ago
- A simple tutorial of SNPE.☆177Updated 2 years ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆79Updated last week
- ☆154Updated 2 months ago
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆270Updated 3 weeks ago
- Run Chinese MobileBert model on SNPE.☆15Updated 2 years ago
- Try to export the ONNX QDQ model that conforms to the AXERA NPU quantization specification. Currently, only w8a8 is supported.☆10Updated 11 months ago
- NCNN的代码学习,各种小Demo。☆116Updated last year
- base quantization methods including: QAT, PTQ, per_channel, per_tensor, dorefa, lsq, adaround, omse, Histogram, bias_correction.etc☆47Updated 2 years ago
- trt-hackathon-2022 三等奖方案☆10Updated 2 years ago
- A parser, editor and profiler tool for ONNX models.☆451Updated 3 weeks ago
- Llama3 Streaming Chat Sample☆22Updated last year
- PyTorch Neural Network eXchange☆614Updated last week
- EasyNN是一个面向教学而开发的神经网络推理框架,旨在让大家0基础也能自主完成推理框架编写!☆32Updated last year
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆68Updated 3 months ago
- YOLOv5 on Orin DLA☆211Updated last year
- nerf☆41Updated 3 years ago
- A Toolkit to Help Optimize Large Onnx Model☆158Updated last year
- Build shared libraries (`.so`) to use TF Lite C++ API in Android applications☆50Updated 2 years ago
- a simple pipline of int8 quantization based on tensorrt.☆66Updated 2 years ago
- 该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。☆70Updated last year
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆842Updated last week
- OpenPose uses Pytorch for static quantization, saving, and loading of models☆89Updated 4 years ago
- This is 8-bit quantization sample for yolov5. Both PTQ, QAT and Partial Quantization have been implemented, and present the results based…☆107Updated 3 years ago
- ☆333Updated last year
- Demonstration of running a native LLM on Android device.☆169Updated last week
- Quantization Aware Training☆80Updated last year
- Script to typecast ONNX model parameters from INT64 to INT32.☆109Updated last year
- A Toolkit to Help Optimize Onnx Model☆198Updated this week