ling0322 / libonmiLinks
Efficient inference of large language models.
☆147Updated 6 months ago
Alternatives and similar repositories for libonmi
Users that are interested in libonmi are comparing it to the libraries listed below
Sorting:
- ☆123Updated last year
- ncnn和pnnx格式编辑器☆133Updated 7 months ago
- Detect CPU features with single-file☆393Updated 3 weeks ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆67Updated last week
- A converter for llama2.c legacy models to ncnn models.☆87Updated last year
- Infere RWKV on NCNN☆48Updated 9 months ago
- ☆32Updated 10 months ago
- Make a minimal OpenCV runable on any where, WIP☆82Updated 2 years ago
- Serving Inside Pytorch☆160Updated 3 weeks ago
- ggml学习笔记,ggml是一个机器学习的推理框架☆15Updated last year
- Inference TinyLlama models on ncnn☆24Updated last year
- 分层解耦的深度学习推理引擎☆73Updated 3 months ago
- ☆31Updated 8 months ago
- This is a demo how to write a high performance convolution run on apple silicon☆54Updated 3 years ago
- UIE(Universal Information Extraction) infer by ncnn☆12Updated 8 months ago
- cpp syntactic sugar☆8Updated 3 weeks ago
- a program language for AI infrastructure☆88Updated this week
- stable diffusion using mnn☆68Updated last year
- Tiny C++11 GPT-2 inference implementation from scratch☆63Updated last week
- NeRF in NCNN with c++ & vulkan☆67Updated last year
- FlagTree is a unified compiler for multiple AI chips, which is forked from triton-lang/triton.☆24Updated this week
- llm deploy project based onnx.☆37Updated 7 months ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆485Updated 7 months ago
- OneFlow->ONNX☆43Updated 2 years ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆71Updated last month
- run ChatGLM2-6B in BM1684X☆49Updated last year
- 将MNN拆解的简易前向推理框架(for study!)☆22Updated 4 years ago
- 支持中文场景的的小语言模型 llama2.c-zh☆147Updated last year
- Benchmark your NCNN models on 3DS(or crash)☆10Updated last year
- 本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行框架。☆196Updated last year