Efficient inference of large language models.
☆151Sep 28, 2025Updated 7 months ago
Alternatives and similar repositories for libllm
Users that are interested in libllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- UIE(Universal Information Extraction) infer by ncnn☆15Sep 22, 2024Updated last year
- Detect CPU features with single-file☆456Apr 6, 2026Updated last month
- ☆16Mar 24, 2025Updated last year
- Benchmark your NCNN models on 3DS(or crash)☆10Apr 15, 2024Updated 2 years ago
- Infere RWKV on NCNN☆49Sep 3, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ppstructure deploy by ncnn☆37Jul 16, 2024Updated last year
- ☆23Jan 3, 2024Updated 2 years ago
- A converter for llama2.c legacy models to ncnn models.☆79Dec 17, 2023Updated 2 years ago
- ☆125Dec 15, 2023Updated 2 years ago
- Visual comparison for YUV and JPG/PNG images.☆21May 16, 2024Updated 2 years ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- NeRF in NCNN with c++ & vulkan☆68Jun 18, 2023Updated 2 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 测试桌面端ncnn c++算法☆17Jun 15, 2025Updated 11 months ago
- ncnn和pnnx格式编辑器☆147Apr 21, 2026Updated last month
- Handy tools & graphics API abstraction for blazing fast prototyping☆10Jan 17, 2024Updated 2 years ago
- linux bsp app & sample for axpi (ax620a)☆36Jun 21, 2023Updated 2 years ago
- a simple programming language under development☆11Dec 3, 2023Updated 2 years ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆91Updated this week
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆81Aug 12, 2024Updated last year
- An implementation of memcpy for amd64 with clang/gcc☆15Feb 7, 2022Updated 4 years ago
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- llm deploy project based onnx.☆49Oct 9, 2024Updated last year
- row-major matmul optimization☆725May 14, 2026Updated last week
- A tool which profiles Vulkan devices to find their peak capacities☆169Apr 14, 2026Updated last month
- Self-trained Large Language Models based on Meta LLaMa☆29Aug 11, 2023Updated 2 years ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆63May 6, 2023Updated 3 years ago
- llm-export can export llm model to onnx.☆352May 8, 2026Updated 2 weeks ago
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆252Mar 15, 2024Updated 2 years ago
- PyTorch Neural Network eXchange☆707May 6, 2026Updated 2 weeks ago
- DragGan in NCNN with c++☆52Oct 5, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- an example of segment-anything infer by ncnn☆124May 5, 2023Updated 3 years ago
- Practical, Easy-to-copy CMake examples☆317Sep 30, 2025Updated 7 months ago
- SAM and lama inpaint,包含QT的GUI交互界面,实现了交互式可实时显示结果的画点、画框进行SAM,然后通过进行Inpaint,具体操作看readme里的视频。☆54Jan 30, 2024Updated 2 years ago
- Tencent NCNN with added CUDA support☆71Jan 18, 2021Updated 5 years ago
- ☆59Nov 21, 2024Updated last year
- ☆28Aug 10, 2023Updated 2 years ago
- ☆42Jun 25, 2020Updated 5 years ago