lovelyzzkei / QNN-Android-ServerLinks
Let's use Qualcomm NPU in Android
☆14Updated 8 months ago
Alternatives and similar repositories for QNN-Android-Server
Users that are interested in QNN-Android-Server are comparing it to the libraries listed below
Sorting:
- Demonstration of combine YOLO and depth estimation on Android device.☆58Updated 2 weeks ago
- Try to export the ONNX QDQ model that conforms to the AXERA NPU quantization specification. Currently, only w8a8 is supported.☆11Updated last year
- PyTorch Quantization Aware Training(QAT,量化感知训练)☆38Updated 2 years ago
- Run Chinese MobileBert model on SNPE.☆15Updated 2 years ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆85Updated this week
- A yolov7-tiny model inference applied on qualcomm snpe for pedestrian detection with embedded system.☆13Updated last year
- A simple tutorial of SNPE.☆178Updated 2 years ago
- QAI AppBuilder is designed to help developers easily execute models on WoS and Linux platforms. It encapsulates the Qualcomm® AI Runtime …☆79Updated last week
- Llama3 Streaming Chat Sample☆22Updated last year
- An onnx-based quantitation tool.☆71Updated last year
- A Toolkit to Help Optimize Large Onnx Model☆161Updated last year
- EasyNN是一个面向教学而开发的神经网络推理框架,旨在让大家0基础也能自主完成推理框架编写!☆33Updated last year
- This is a repository to practice multi-thread programming in C++☆26Updated last year
- base quantization methods including: QAT, PTQ, per_channel, per_tensor, dorefa, lsq, adaround, omse, Histogram, bias_correction.etc☆50Updated 2 years ago
- A set of examples around MegEngine☆31Updated last year
- the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml…☆35Updated 3 months ago
- snpe tutorial☆10Updated last year
- ☆164Updated 4 months ago
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆14Updated 2 years ago
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆71Updated 5 months ago
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆12Updated last year
- ☆16Updated last year
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX. And Export the ONNX model for AXera's NPU☆12Updated 2 months ago
- 该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。☆70Updated 2 years ago
- 本仓库在OpenVINO推理框架下部署Nanodet检测算法,并重写预处理和后处理部分,具有超高性能!让你在Intel CPU平台上的检测速度起飞! 并基于NNCF和PPQ工具将模型量化(PTQ)至int8精度,推理速度更快!☆15Updated 2 years ago
- ☆26Updated 2 years ago
- SAM and lama inpaint,包含QT的GUI交互界面,实现了交互式可实时显示结果的画点、画框进行SAM,然后通过进行Inpaint,具体操作看readme里的视频。☆50Updated last year
- llm deploy project based onnx.☆45Updated last year
- Speed up image preprocess with cuda when handle image or tensorrt inference☆79Updated 2 weeks ago
- An object tracking project with YOLOv5-v5.0 and Deepsort, speed up by C++ and TensorRT.☆17Updated this week