lovelyzzkei / QNN-Android-ServerView external linksLinks
Let's use Qualcomm NPU in Android
☆18Feb 18, 2025Updated 11 months ago
Alternatives and similar repositories for QNN-Android-Server
Users that are interested in QNN-Android-Server are comparing it to the libraries listed below
Sorting:
- codebase for "MELTing Point: Mobile Evaluation of Language Transformers"☆18Jul 19, 2024Updated last year
- ☆120Updated this week
- This project is intended to build and deploy an SNPE model on Qualcomm Devices, which are having unsupported layers which are not part of…☆10Oct 4, 2021Updated 4 years ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Feb 5, 2026Updated last week
- 使用ONNXRuntime部署一种用于边缘检测的轻量级密集卷积神经网络LDC,包含C++和Python两个版本的程序☆11Apr 24, 2023Updated 2 years ago
- Official codes for "DeepSVC: Deep Scalable Video Coding for Both Machine and Human Vision"☆12Nov 29, 2023Updated 2 years ago
- 关于算法处理实时视频流性能不足使用并行处理的方案和优化(APP层面)。☆11Jun 5, 2021Updated 4 years ago
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated last year
- tdnn (time delay neural network) tensorflow implementation☆10Mar 6, 2020Updated 5 years ago
- CenterNet3D 部署版本,便于移植不同平台(onnx、tensorRT、rknn、Horizon)。☆13May 24, 2024Updated last year
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆12Jan 9, 2024Updated 2 years ago
- Try to export the ONNX QDQ model that conforms to the AXERA NPU quantization specification. Currently, only w8a8 is supported.☆11Sep 10, 2024Updated last year
- 🎉My Collections of CUDA Kernels~☆11Jun 25, 2024Updated last year
- A lightweight distributed RPC framework powered by pure C language and based on ZeroMQ and pbc.☆10Mar 4, 2014Updated 11 years ago
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Mar 5, 2025Updated 11 months ago
- Collection of Machine Learning examples using MLEK CMSIS-pack.☆10Dec 8, 2025Updated 2 months ago
- trt-hackathon-2022 三等奖方案☆10Mar 6, 2023Updated 2 years ago
- ☆10Jul 18, 2024Updated last year
- A fourier-based audio-synthesiser wrote in MATLAB as a university project.☆12Jan 19, 2019Updated 7 years ago
- FastSAM 部署rknn C++ 代码☆14May 30, 2024Updated last year
- 重构nerf代码,更加容易读懂☆13Mar 26, 2023Updated 2 years ago
- Inference deployment of the llama3☆11Apr 21, 2024Updated last year
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆372Jan 27, 2026Updated 2 weeks ago
- Converts CLIP models to ONNX☆11Jan 17, 2023Updated 3 years ago
- YUV player☆13Jan 9, 2022Updated 4 years ago
- snpe tutorial☆10Dec 25, 2023Updated 2 years ago
- The Purpose of this repository is to create a DeepStream/Triton-Server sample application that utilizes yolov7, yolov7-qat, yolov9 models…☆12Apr 1, 2024Updated last year
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆26Jan 22, 2026Updated 3 weeks ago
- Bert TensorRT模型加速部署☆10Apr 1, 2022Updated 3 years ago
- Flash Attention in ~100 lines of CUDA (forward pass only)☆11Jun 10, 2024Updated last year
- ☆13Oct 5, 2023Updated 2 years ago
- This repository implements the YOLOv9 model on Jetson Orin Nano☆12Aug 28, 2024Updated last year
- Dockerfiles for poetry/mlc-llm(rk3588)/...☆10Sep 13, 2023Updated 2 years ago
- LDC: Lightweight Dense CNN for Edge DetectionのPythonでのONNX推論サンプル☆15May 6, 2023Updated 2 years ago
- Demonstrate Yolov9 model with Qualcomm Hexagon NPU and DirectML☆12Nov 27, 2024Updated last year
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX. And Export the ONNX model for AXera's NPU☆12Aug 11, 2025Updated 6 months ago
- A Pytorch implementation for ELIC (CVPR 2022).☆10Mar 29, 2023Updated 2 years ago
- JAX bindings for the flash-attention3 kernels☆20Jan 2, 2026Updated last month
- paper-read-notes☆13Sep 26, 2024Updated last year