ling0322 / libllmLinks
Efficient inference of large language models.
☆150Updated 3 months ago
Alternatives and similar repositories for libllm
Users that are interested in libllm are comparing it to the libraries listed below
Sorting:
- ncnn和pnnx格式编辑器☆137Updated last year
- ☆125Updated 2 years ago
- A repo for llm on ncnn☆178Updated 2 weeks ago
- Detect CPU features with single-file☆441Updated last week
- Tiny C++ LLM inference implementation from scratch☆98Updated last month
- A converter for llama2.c legacy models to ncnn models.☆79Updated 2 years ago
- Inference TinyLlama models on ncnn☆24Updated 2 years ago
- llm deploy project based onnx.☆49Updated last year
- a simple general program language☆99Updated this week
- Infere RWKV on NCNN☆49Updated last year
- GPT2⚡NCNN⚡中文对话⚡x86⚡Android☆82Updated 3 years ago
- ☆34Updated last year
- ☆33Updated last year
- 分层解耦的深度学 习推理引擎☆79Updated 11 months ago
- 将MNN拆解的简易前向推理框架(for study!)☆23Updated 4 years ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Updated last month
- Make a minimal OpenCV runable on any where, WIP☆85Updated 3 years ago
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Updated 10 months ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆489Updated last year
- 关于自建AI推理引擎的手册,从0开始你需要知道的所有事情☆273Updated 3 years ago
- stable diffusion using mnn☆67Updated 2 years ago
- Explore LLM model deployment based on AXera's AI chips☆136Updated this week
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Updated 2 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Updated 2 years ago
- ☆84Updated 2 years ago
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Updated last year
- ☆43Updated 3 years ago
- High Performan Ai Model Web Server. Mainly support computer vision model. Quickly establish your own ai-model server. https://github.com/…☆45Updated 8 months ago
- This is a demo how to write a high performance convolution run on apple silicon☆57Updated 3 years ago
- qwen2 and llama3 cpp implementation☆49Updated last year