Efficient inference of large language models.
☆149Sep 28, 2025Updated 5 months ago
Alternatives and similar repositories for libllm
Users that are interested in libllm are comparing it to the libraries listed below
Sorting:
- UIE(Universal Information Extraction) infer by ncnn☆15Sep 22, 2024Updated last year
- Detect CPU features with single-file☆452Mar 5, 2026Updated 2 weeks ago
- ☆16Mar 24, 2025Updated 11 months ago
- Benchmark your NCNN models on 3DS(or crash)☆10Apr 15, 2024Updated last year
- ☆32Jul 23, 2024Updated last year
- Infere RWKV on NCNN☆49Sep 3, 2024Updated last year
- 分层解耦的深度学习推理引擎☆78Feb 17, 2025Updated last year
- A converter for llama2.c legacy models to ncnn models.☆79Dec 17, 2023Updated 2 years ago
- ☆23Jan 3, 2024Updated 2 years ago
- Visual comparison for YUV and JPG/PNG images.☆20May 16, 2024Updated last year
- ppstructure deploy by ncnn☆36Jul 16, 2024Updated last year
- ☆124Dec 15, 2023Updated 2 years ago
- 基于NCNN框架实现车道线检测(C/C++)☆24Apr 21, 2025Updated 11 months ago
- NeRF in NCNN with c++ & vulkan☆67Jun 18, 2023Updated 2 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- 测试桌面端ncnn c++算法☆17Jun 15, 2025Updated 9 months ago
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- a simple programming language under development☆11Dec 3, 2023Updated 2 years ago
- Inference TinyLlama models on ncnn☆24Aug 15, 2023Updated 2 years ago
- ncnn和pnnx格式编辑器☆137Oct 7, 2024Updated last year
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆81Aug 12, 2024Updated last year
- An implementation of memcpy for amd64 with clang/gcc☆15Feb 7, 2022Updated 4 years ago
- row-major matmul optimization☆707Feb 24, 2026Updated 3 weeks ago
- A tool which profiles Vulkan devices to find their peak capacities☆163Feb 27, 2026Updated 3 weeks ago
- Self-trained Large Language Models based on Meta LLaMa☆29Aug 11, 2023Updated 2 years ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆63May 6, 2023Updated 2 years ago
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆250Mar 15, 2024Updated 2 years ago
- PyTorch Neural Network eXchange☆700Updated this week
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Feb 14, 2026Updated last month
- an example of segment-anything infer by ncnn☆123May 5, 2023Updated 2 years ago
- Practical, Easy-to-copy CMake examples☆316Sep 30, 2025Updated 5 months ago
- DontStarve mods☆11May 27, 2019Updated 6 years ago
- llm-export can export llm model to onnx.☆344Oct 24, 2025Updated 4 months ago
- Tencent NCNN with added CUDA support☆71Jan 18, 2021Updated 5 years ago
- ☆28Aug 10, 2023Updated 2 years ago
- ☆60Nov 21, 2024Updated last year
- ☆42Jun 25, 2020Updated 5 years ago
- 一些有用的功能☆47Mar 13, 2026Updated last week
- [MobiCom 24] Memory-adaptive DNN inference on edge☆58Jan 22, 2025Updated last year