Efficient inference of large language models.
☆150Sep 28, 2025Updated 7 months ago
Alternatives and similar repositories for libllm
Users that are interested in libllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- UIE(Universal Information Extraction) infer by ncnn☆15Sep 22, 2024Updated last year
- Detect CPU features with single-file☆456Apr 6, 2026Updated 3 weeks ago
- ☆16Mar 24, 2025Updated last year
- Benchmark your NCNN models on 3DS(or crash)☆10Apr 15, 2024Updated 2 years ago
- ☆33Jul 23, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Infere RWKV on NCNN☆49Sep 3, 2024Updated last year
- ppstructure deploy by ncnn☆37Jul 16, 2024Updated last year
- ☆23Jan 3, 2024Updated 2 years ago
- A converter for llama2.c legacy models to ncnn models.☆79Dec 17, 2023Updated 2 years ago
- ☆125Dec 15, 2023Updated 2 years ago
- Visual comparison for YUV and JPG/PNG images.☆21May 16, 2024Updated last year
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- NeRF in NCNN with c++ & vulkan☆68Jun 18, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- 测试桌面端ncnn c++算法☆17Jun 15, 2025Updated 10 months ago
- ncnn和pnnx格式编辑器☆146Apr 21, 2026Updated last week
- linux bsp app & sample for axpi (ax620a)☆36Jun 21, 2023Updated 2 years ago
- a simple programming language under development☆11Dec 3, 2023Updated 2 years ago
- Inference TinyLlama models on ncnn☆24Aug 15, 2023Updated 2 years ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆91Feb 14, 2026Updated 2 months ago
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆81Aug 12, 2024Updated last year
- An implementation of memcpy for amd64 with clang/gcc☆14Feb 7, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- llm deploy project based onnx.☆49Oct 9, 2024Updated last year
- row-major matmul optimization☆721Feb 24, 2026Updated 2 months ago
- A tool which profiles Vulkan devices to find their peak capacities☆169Apr 14, 2026Updated 2 weeks ago
- Self-trained Large Language Models based on Meta LLaMa☆29Aug 11, 2023Updated 2 years ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆63May 6, 2023Updated 2 years ago
- llm-export can export llm model to onnx.☆350Oct 24, 2025Updated 6 months ago
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆250Mar 15, 2024Updated 2 years ago
- PyTorch Neural Network eXchange☆704Apr 14, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- DragGan in NCNN with c++☆52Oct 5, 2023Updated 2 years ago
- an example of segment-anything infer by ncnn☆124May 5, 2023Updated 2 years ago
- Practical, Easy-to-copy CMake examples☆317Sep 30, 2025Updated 7 months ago
- DontStarve mods☆11May 27, 2019Updated 6 years ago
- SAM and lama inpaint,包含QT的GUI交互界面,实现了交互式可实时显示结果的画点、画框进行SAM,然后通过进行Inpaint,具体操作看readme里的视频。☆54Jan 30, 2024Updated 2 years ago
- Tencent NCNN with added CUDA support☆71Jan 18, 2021Updated 5 years ago
- ☆59Nov 21, 2024Updated last year