☆63Dec 16, 2025Updated 3 months ago
Alternatives and similar repositories for llama.cpp-npu
Users that are interested in llama.cpp-npu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Self-implemented NN operators for Qualcomm's Hexagon NPU☆53Sep 30, 2025Updated 5 months ago
- High-speed and easy-use LLM serving framework for local deployment☆145Aug 7, 2025Updated 7 months ago
- YOLOv5在高通AI Engine Direct环境下进行QNN量化,CPU推理的项目☆15Sep 10, 2024Updated last year
- Run Chinese MobileBert model on SNPE.☆14May 19, 2023Updated 2 years ago
- Implementation of Generalized Cross Correlation with Phase Transform (GCC-PHAT) library in C/C++.☆20Jul 8, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ToyLLM: Learning LLM from Scratch☆25Mar 16, 2026Updated last week
- Home page for Microsoft Phi-Ground tech-report☆23Sep 8, 2025Updated 6 months ago
- Demonstrate Yolov9 model with Qualcomm Hexagon NPU and DirectML☆12Nov 27, 2024Updated last year
- ☆28Dec 2, 2024Updated last year
- ☆11Sep 20, 2024Updated last year
- [DATE'2025, TCAD'2025] Terafly : A Multi-Node FPGA Based Accelerator Design for Efficient Cooperative Inference in LLMs☆30Nov 13, 2025Updated 4 months ago
- 2019年全国大学生电子设计大赛G题双路语音调频接收机的FPGA全实现☆18Apr 15, 2020Updated 5 years ago
- ☆16Mar 4, 2026Updated 3 weeks ago
- this is just a copy of https://code.msdn.microsoft.com/windowsdesktop/DirectCompute-Graphics-425de5a8☆11Nov 1, 2017Updated 8 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.