ling0322 / libllmLinks
Efficient inference of large language models.
☆149Updated 4 months ago
Alternatives and similar repositories for libllm
Users that are interested in libllm are comparing it to the libraries listed below
Sorting:
- ncnn和pnnx格式编辑器☆137Updated last year
- ☆125Updated 2 years ago
- Tiny C++ LLM inference implementation from scratch☆102Updated last week
- A repo for llm on ncnn☆189Updated last week
- Detect CPU features with single-file☆443Updated last week
- A converter for llama2.c legacy models to ncnn models.☆79Updated 2 years ago
- Inference TinyLlama models on ncnn☆24Updated 2 years ago
- llm deploy project based onnx.☆49Updated last year
- Make a minimal OpenCV runable on any where, WIP☆87Updated 3 years ago
- 关于自建AI推理引擎的手册,从0开始你需要知道的所有事情☆272Updated 3 years ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Updated this week
- ☆34Updated last year
- ☆33Updated last year
- Infere RWKV on NCNN☆49Updated last year
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆488Updated last year
- a simple general program language☆100Updated last week
- 分层解耦的深度学习推理引擎☆79Updated 11 months ago
- ncnn implementation of Z-Image image generater☆94Updated this week
- ggml学习笔记,ggml 是一个机器学习的推理框架☆18Updated last year
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Updated 11 months ago
- GPT2⚡NCNN⚡中文对话⚡x86⚡Android☆82Updated 3 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Updated 2 years ago
- stable diffusion using mnn☆67Updated 2 years ago
- This is a demo how to write a high performance convolution run on apple silicon☆57Updated 4 years ago
- CPU inference for the DeepSeek family of large language models in C++☆315Updated 4 months ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆102Updated last month
- qwen2 and llama3 cpp implementation☆49Updated last year
- Header-only safetensors loader and saver in C++☆78Updated last month
- ☆85Updated 2 years ago
- A high performance, high expansion, easy to use framework for AI application. 为AI应用的开发者提供一套统一的高性能、易用的编程框架,快速基于AI全栈服务、开发跨端边云的AI行业应用,支持GPU,…☆160Updated last year