ling0322 / omni.rsLinks

Efficient inference of large language models.

☆148

Alternatives and similar repositories for omni.rs

Users that are interested in omni.rs are comparing it to the libraries listed below

Sorting:

daquexian / faster-rwkv
☆124Updated last year
scarsty / ncnn-editor
ncnn和pnnx格式编辑器
☆133Updated 8 months ago
nihui / ruapu
Detect CPU features with single-file
☆403Updated 3 weeks ago
lrw04 / llama2.c-to-ncnn
A converter for llama2.c legacy models to ncnn models.
☆87Updated last year
EdVince / llm-cpp
☆32Updated 10 months ago
MollySophia / rwkv-qualcomm
Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK
☆68Updated last week
lrw04 / tinyllamas-ncnn
Inference TinyLlama models on ncnn
☆24Updated last year
lucasjinreal / simpleocv
Make a minimal OpenCV runable on any where, WIP
☆82Updated 2 years ago
MollySophia / rwkv-ncnn
Infere RWKV on NCNN
☆48Updated 9 months ago
wangzhaode / mnn-stable-diffusion
stable diffusion using mnn
☆68Updated last year
EdVince / diffusers-ncnn
☆83Updated 2 years ago
corleonechensiyu / tinyCNN
将MNN拆解的简易前向推理框架(for study!)
☆22Updated 4 years ago
MegEngine / MegPeak
☆249Updated last year
MollySophia / rwkv-mobile
Inference RWKV with multiple supported backends.
☆48Updated this week
EdVince / NeRF-NCNN
NeRF in NCNN with c++ & vulkan
☆67Updated last year
StudyingLover / ggml-tutorial
☆31Updated 8 months ago
prajna-lang / prajna
a program language for AI infrastructure
☆88Updated last week
yvonwin / qwen2.cpp
qwen2 and llama3 cpp implementation
☆44Updated last year
nihui / ncnn-android-ppocrv5
☆19Updated last week
EdVince / DragGan-NCNN
DragGan in NCNN with c++
☆50Updated last year
MegEngine / MegCC
MegCC是一个运行时超轻量，高效，移植简单的深度学习模型编译器
☆485Updated 7 months ago
EdVince / GPT2-ChineseChat-NCNN
GPT2⚡NCNN⚡中文对话⚡x86⚡Android
☆79Updated 3 years ago
wangzhaode / onnx-llm
llm deploy project based onnx.
☆37Updated 7 months ago
torchpipe / torchpipe
Serving Inside Pytorch
☆160Updated 3 weeks ago
nihui / riscv-v-toolchain
prebuild package for cross compiling riscv
☆18Updated 3 years ago
pigirons / conv3x3_m1
This is a demo how to write a high performance convolution run on apple silicon
☆54Updated 3 years ago
FlagTree / flagtree
FlagTree is a unified compiler for multiple AI chips, which is forked from triton-lang/triton.
☆24Updated this week
MegEngine / mperf
mperf是一个面向移动/嵌入式平台的算子性能调优工具箱
☆187Updated last year
caiwanxianhust / FasterLLaMA
使用 CUDA C++ 实现的 llama 模型推理框架
☆57Updated 6 months ago
jinmingyi1998 / opencl_kernels
An easy way to run, test, benchmark and tune OpenCL kernel files
☆23Updated last year