ling0322 / omni.rsLinks
Efficient inference of large language models.
☆148Updated this week
Alternatives and similar repositories for omni.rs
Users that are interested in omni.rs are comparing it to the libraries listed below
Sorting:
- ☆124Updated last year
- ncnn和pnnx格式编辑器☆133Updated 8 months ago
- Detect CPU features with single-file☆403Updated 3 weeks ago
- A converter for llama2.c legacy models to ncnn models.☆87Updated last year
- ☆32Updated 10 months ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆68Updated last week
- Inference TinyLlama models on ncnn☆24Updated last year
- Make a minimal OpenCV runable on any where, WIP☆82Updated 2 years ago
- Infere RWKV on NCNN☆48Updated 9 months ago
- stable diffusion using mnn☆68Updated last year
- ☆83Updated 2 years ago
- 将MNN拆解的简易前向推理框架(for study!)☆22Updated 4 years ago
- ☆249Updated last year
- Inference RWKV with multiple supported backends.☆48Updated this week
- NeRF in NCNN with c++ & vulkan☆67Updated last year
- ☆31Updated 8 months ago
- a program language for AI infrastructure☆88Updated last week
- qwen2 and llama3 cpp implementation☆44Updated last year
- ☆19Updated last week
- DragGan in NCNN with c++☆50Updated last year
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆485Updated 7 months ago
- GPT2⚡NCNN⚡中文对话⚡x86⚡Android☆79Updated 3 years ago
- llm deploy project based onnx.☆37Updated 7 months ago
- Serving Inside Pytorch☆160Updated 3 weeks ago
- prebuild package for cross compiling riscv☆18Updated 3 years ago
- This is a demo how to write a high performance convolution run on apple silicon☆54Updated 3 years ago
- FlagTree is a unified compiler for multiple AI chips, which is forked from triton-lang/triton.☆24Updated this week
- mperf是一个面向移动/嵌入式平台的算子性能调优工具箱☆187Updated last year
- 使用 CUDA C++ 实现的 llama 模型推理框架☆57Updated 6 months ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆23Updated last year