A repo for llm on ncnn
☆219Apr 3, 2026Updated last month
Alternatives and similar repositories for ncnn_llm
Users that are interested in ncnn_llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- Using ncnn to test the reasoning performance of neural network☆38Jan 18, 2026Updated 3 months ago
- ☆10Jul 18, 2024Updated last year
- 高性能 高精度 大陆车牌、港澳车牌、台湾车牌 韩国车牌(South Korea LPR)识别 代码开源(ncnn移植)☆44Nov 5, 2025Updated 6 months ago
- UIE(Universal Information Extraction) infer by ncnn☆15Sep 22, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆28Jan 4, 2026Updated 4 months ago
- ncnn android piper the fast and local neural text-to-speech engine☆62Jan 14, 2026Updated 3 months ago
- ☆14Nov 3, 2025Updated 6 months ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- 使用 Rust 语言重新实现 https://github.com/zjhellofss/KuiperInfer 和 https://github.com/zjhellofss/kuiperdatawhale 中的深度学习推理框架。☆17Apr 9, 2024Updated 2 years ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Oct 20, 2023Updated 2 years ago
- ncnn export & infer mobileclip☆21Aug 18, 2025Updated 8 months ago
- ☆22Mar 5, 2024Updated 2 years ago
- Try to export the ONNX QDQ model that conforms to the AXERA NPU quantization specification. Currently, only w8a8 is supported.☆11Sep 10, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Yolov12 model supports android deployment.☆147Jun 12, 2025Updated 10 months ago
- ncnn implementation of Z-Image image generater☆407Apr 15, 2026Updated 3 weeks ago
- [Deprecated] reflink for Windows☆28Sep 18, 2025Updated 7 months ago
- ncnn version of CodeFormer☆109Mar 9, 2023Updated 3 years ago
- 一个PyTorch实现的五子棋AI项目☆39Mar 16, 2026Updated last month
- ncnn android yolov8 realtime detection, segmentation, pose estimation, classification and obb☆197Jan 14, 2026Updated 3 months ago
- The repository supports TensorRT, QNN platform inference, 2D obstacle detection yolo series (yolov5, yolov8, yolo11, yolox), semantic seg…☆20May 6, 2025Updated last year
- This repository implements the YOLOv9 model on Jetson Orin Nano☆19Aug 28, 2024Updated last year
- Use ncnn model of yolo11 without magic operation☆21Dec 6, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This code is for converting COCO json annotations to YOLO txt format (which both are common in object detection projects).☆10Feb 19, 2024Updated 2 years ago
- ExNorVPN free☆12Jun 6, 2020Updated 5 years ago
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- YOLOv5 series model supports the latest TensorRT10.☆17Jul 24, 2024Updated last year
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆30Jan 22, 2026Updated 3 months ago
- ncnn is a high-performance neural network inference framework optimized for the mobile platform☆14May 20, 2022Updated 3 years ago
- Ooura's General Purpose FFT (Fast Fourier/Cosine/Sine Transform) Package☆14Aug 21, 2023Updated 2 years ago
- ☆24Apr 5, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆16Jan 30, 2020Updated 6 years ago
- yolov8s-pose using ncnn inferring!☆44Apr 27, 2023Updated 3 years ago
- c++实现的clip推理,模型有一点点改动,但是不大,改动和导出模型的代码可以在readme里找到,模型文件都在Releases里,包括AX650的模型。新增支持ChineseCLIP☆31Jun 19, 2025Updated 10 months ago
- A converter for llama2.c legacy models to ncnn models.☆79Dec 17, 2023Updated 2 years ago
- Android paddleocr demo infer by ncnn☆210Jul 23, 2024Updated last year
- 基于yolov8_obb的芯片引脚缺陷检测,使用tensorrt进行加速。☆23Aug 2, 2024Updated last year
- mnn yolo demos.☆86Oct 9, 2024Updated last year