Efficient inference of large language models.
☆150Sep 28, 2025Updated 6 months ago
Alternatives and similar repositories for libllm
Users that are interested in libllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- UIE(Universal Information Extraction) infer by ncnn☆15Sep 22, 2024Updated last year
- Detect CPU features with single-file☆453Updated this week
- ☆16Mar 24, 2025Updated last year
- Benchmark your NCNN models on 3DS(or crash)☆10Apr 15, 2024Updated last year
- ☆33Jul 23, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Infere RWKV on NCNN☆49Sep 3, 2024Updated last year
- 分层解耦的深度学习推理引擎☆79Feb 17, 2025Updated last year
- ppstructure deploy by ncnn☆36Jul 16, 2024Updated last year
- A converter for llama2.c legacy models to ncnn models.☆79Dec 17, 2023Updated 2 years ago
- ☆23Jan 3, 2024Updated 2 years ago
- ☆125Dec 15, 2023Updated 2 years ago
- Visual comparison for YUV and JPG/PNG images.☆21May 16, 2024Updated last year
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 基于NCNN框架实现车道线检测(C/C++)☆24Apr 21, 2025Updated 11 months ago
- NeRF in NCNN with c++ & vulkan☆68Jun 18, 2023Updated 2 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- 测试桌面端ncnn c++算法☆17Jun 15, 2025Updated 9 months ago
- ncnn和pnnx格式编辑器☆138Oct 7, 2024Updated last year
- Handy tools & graphics API abstraction for blazing fast prototyping☆10Jan 17, 2024Updated 2 years ago
- linux bsp app & sample for axpi (ax620a)☆36Jun 21, 2023Updated 2 years ago
- a simple programming language under development☆11Dec 3, 2023Updated 2 years ago
- Inference TinyLlama models on ncnn☆24Aug 15, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆91Feb 14, 2026Updated last month
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆82Aug 12, 2024Updated last year
- An implementation of memcpy for amd64 with clang/gcc☆14Feb 7, 2022Updated 4 years ago
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- llm deploy project based onnx.☆50Oct 9, 2024Updated last year
- row-major matmul optimization☆713Feb 24, 2026Updated last month
- A tool which profiles Vulkan devices to find their peak capacities☆168Feb 27, 2026Updated last month
- Self-trained Large Language Models based on Meta LLaMa☆29Aug 11, 2023Updated 2 years ago
- llm-export can export llm model to onnx.☆347Oct 24, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆250Mar 15, 2024Updated 2 years ago
- PyTorch Neural Network eXchange☆702Apr 3, 2026Updated last week
- DragGan in NCNN with c++☆52Oct 5, 2023Updated 2 years ago
- an example of segment-anything infer by ncnn☆124May 5, 2023Updated 2 years ago
- Practical, Easy-to-copy CMake examples☆316Sep 30, 2025Updated 6 months ago
- DontStarve mods☆11May 27, 2019Updated 6 years ago
- SAM and lama inpaint,包含QT的GUI交互界面,实现了交互式可实时显示结果的画点、画框进行SAM,然后通过进行Inpaint,具体操作看readme里的视频。☆53Jan 30, 2024Updated 2 years ago