KerfuffleV2 / smolrsrwkvLinks

A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit evaluation. It can also directly load PyTorch RWKV models.

☆94

Alternatives and similar repositories for smolrsrwkv

Users that are interested in smolrsrwkv are comparing it to the libraries listed below

Sorting:

mrsteyk / rwkvk-rs
☆32Updated 2 years ago
Noeda / rllama
Rust+OpenCL+AVX2 implementation of LLaMA inference code
☆551Updated last year
KerfuffleV2 / rusty-ggml
GGML bindings that aim to be idiomatic Rust rather than directly corresponding to the C/C++ interface
☆19Updated 2 years ago
KerfuffleV2 / ggml-sys-bleedingedge
Bleeding edge low level Rust binding for GGML
☆16Updated last year
Narsil / smelte-rs
☆58Updated 2 years ago
edgenai / llama_cpp-rs
High-level, optionally asynchronous Rust bindings to llama.cpp
☆235Updated last year
pixelspark / poly
A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust
☆79Updated last year
leo-du / llama2.rs
Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust
☆39Updated 2 years ago
EndlessReform / fish-speech.rs
A Fish Speech implementation in Rust, with Candle.rs
☆105Updated 6 months ago
kroggen / mamba.c
Inference of Mamba models in pure C
☆194Updated last year
Mathemmagician / rustygrad
Tiny Autograd engine written in Rust
☆61Updated last year
LaurentMazare / glim
☆19Updated last year
tomsanbear / bitnet-rs
Implementing the BitNet model in Rust
☆42Updated last year
cryscan / web-rwkv
Implementation of the RWKV language model in pure WebGPU/Rust.
☆331Updated last month
chelsea0x3b / llama-dfdx
LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!
☆110Updated 2 years ago
KarelPeeters / Kyanite
A neural network inference library, written in Rust.
☆70Updated last year
Narsil / ggblas
☆28Updated 2 years ago
wozeparrot / tinyrwkv
tinygrad port of the RWKV large language model.
☆45Updated 8 months ago
LaurentMazare / ug
Experimental compiler for deep learning models
☆71Updated 2 months ago
gaxler / llama2.rs
Inference Llama 2 in one file of pure Rust 🦀
☆235Updated 2 years ago
LaurentMazare / mamba.rs
☆135Updated last year
npc-engine / edge-transformers
Rust implementation of Huggingface transformers pipelines using onnxruntime backend with bindings to C# and C.
☆39Updated 2 years ago
EricLBuehler / candle-lora
Low rank adaptation (LoRA) for Candle.
☆168Updated 7 months ago
lachlansneff / nxml
LLaMA from First Principles
☆51Updated 2 years ago
gnp / minbpe-rs
Port of Andrej Karpathy's minbpe to Rust
☆30Updated last year
oxideai / diffusers-burn
A diffusers API in Burn (Rust)
☆22Updated last year
KGrewal1 / candle-optimisers
A collection of optimisers for use with candle
☆44Updated this week
AmineDiro / cria
OpenAI compatible API for serving LLAMA-2 model
☆218Updated 2 years ago
wavey-ai / mel-spec
Rust library for whisper.cpp compatible Mel spectrograms
☆80Updated last week
kayvr / token-hawk
WebGPU LLM inference tuned by hand
☆151Updated 2 years ago