the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml-org/llama.cpp/pull/12326. not maintained since Jul 15 2025
☆38Jul 14, 2025Updated 7 months ago
Alternatives and similar repositories for ggml-hexagon
Users that are interested in ggml-hexagon are comparing it to the libraries listed below
Sorting:
- ☆11Feb 7, 2026Updated last month
- Inference deployment of the llama3☆11Apr 21, 2024Updated last year
- LLM inference in C/C++☆48Feb 27, 2026Updated last week
- ☆10Jul 18, 2024Updated last year
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Feb 14, 2026Updated 3 weeks ago
- workbench for learning and practicing on-device AI technology in real scenario with online-TV on Android phone, powered by ggml(llama.cpp…☆187Jun 12, 2025Updated 8 months ago
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Mar 24, 2024Updated last year
- 瑞芯微芯片的rknn推理框架部署(yolo模型)☆13Jul 17, 2025Updated 7 months ago
- RKNN-YOLOV5-BatchInference-MultiThreadingYOLOV5多张图片多线程C++推理☆22Nov 6, 2023Updated 2 years ago
- Sophgo AI chips driver and runtime library.☆24Feb 5, 2026Updated last month
- A one-page-only CGraph-API-liked DAG project.☆25Feb 11, 2025Updated last year
- RKNN模型推理部署模板☆24Aug 11, 2023Updated 2 years ago
- ☆27Mar 17, 2025Updated 11 months ago
- ☆28Jun 30, 2025Updated 8 months ago
- NanoTrack(@HonglinChu), C++ TensorRT deployment. MAX 250 FPS!☆28Nov 6, 2023Updated 2 years ago
- LLM inference in C/C++☆26Jan 27, 2026Updated last month
- ☆39Feb 12, 2026Updated 3 weeks ago
- yolov8 旋转目标检测部署,瑞芯微RKNN芯片部署、地平线Horizon芯片部署、TensorRT部署☆28Jun 4, 2024Updated last year
- C++ implementations for various tokenizers (sentencepiece, tiktoken etc).☆49Feb 23, 2026Updated 2 weeks ago
- ☆33Jul 23, 2024Updated last year
- Port of Funasr's Paraformer model in C/C++☆39Jun 19, 2024Updated last year
- 本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行框架。☆235Mar 29, 2024Updated last year
- stable diffusion using mnn☆66Sep 28, 2023Updated 2 years ago
- This project is intended to build and deploy an SNPE model on Qualcomm Devices, which are having unsupported layers which are not part of…☆10Oct 4, 2021Updated 4 years ago
- ☆19Apr 14, 2025Updated 10 months ago
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 3 months ago
- PyTorch for RISC-V Architecture on OpenEuler 24.03☆13Jun 27, 2024Updated last year
- 使用ONNXRuntime部署一种用于边缘检测的轻量级密集卷积神经网络LDC,包含C++和Python两个版本的程序☆11Apr 24, 2023Updated 2 years ago
- This python script can help you to detect what object is in moving.☆12Nov 28, 2018Updated 7 years ago
- PyTorch Quantization Aware Training(QAT,量化感知训练)☆42Oct 13, 2023Updated 2 years ago
- 基于yolov5的C++单目摄像头测距☆40Nov 2, 2023Updated 2 years ago
- FastRPC is Qualcomm's userspace library that facilitates efficient remote procedure calls between the CPU and DSP for high-performance co…☆76Updated this week
- Implementation of yolo v10 in c++ std 17 over opencv and onnxruntime☆90Sep 28, 2024Updated last year
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Mar 5, 2025Updated last year
- A fourier-based audio-synthesiser wrote in MATLAB as a university project.☆12Jan 19, 2019Updated 7 years ago
- SHU 上海大学抢课助手(2024 新系统)☆12Dec 24, 2024Updated last year
- Guide to deploying deep-learning inference networks and deep vision primitives on SOPHON TPU.☆19Nov 14, 2025Updated 3 months ago
- 重构nerf代码,更加容易读懂☆12Mar 26, 2023Updated 2 years ago
- Use Maplibre Map as a base layer & map styles and Graphhooper Map route navigation data for polyline, duration and time.☆17Jun 16, 2025Updated 8 months ago