KerfuffleV2 / smolrsrwkvLinks
A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit evaluation. It can also directly load PyTorch RWKV models.
☆94Updated 2 years ago
Alternatives and similar repositories for smolrsrwkv
Users that are interested in smolrsrwkv are comparing it to the libraries listed below
Sorting:
- ☆32Updated 2 years ago
- GGML bindings that aim to be idiomatic Rust rather than directly corresponding to the C/C++ interface☆19Updated 2 years ago
- Bleeding edge low level Rust binding for GGML☆16Updated last year
- Rust+OpenCL+AVX2 implementation of LLaMA inference code☆550Updated last year
- ☆58Updated 2 years ago
- High-level, optionally asynchronous Rust bindings to llama.cpp☆232Updated last year
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆79Updated last year
- tinygrad port of the RWKV large language model.☆44Updated 8 months ago
- Inference of Mamba models in pure C☆192Updated last year
- Implementation of the RWKV language model in pure WebGPU/Rust.☆327Updated 3 weeks ago
- LLaMA from First Principles☆51Updated 2 years ago
- ☆19Updated last year
- Inference Llama 2 in one file of pure Rust 🦀☆233Updated 2 years ago
- Low rank adaptation (LoRA) for Candle.☆166Updated 6 months ago
- A highly customizable, full scale web backend for web-rwkv, built on axum with websocket protocol.☆27Updated last year
- WebGPU LLM inference tuned by hand☆150Updated 2 years ago
- Implementing the BitNet model in Rust☆41Updated last year
- LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!☆110Updated 2 years ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆39Updated 2 years ago
- Tiny Autograd engine written in Rust☆61Updated last year
- ☆28Updated 2 years ago
- Rust implementation of Huggingface transformers pipelines using onnxruntime backend with bindings to C# and C.☆39Updated 2 years ago
- A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependenci…☆313Updated last year
- A neural network inference library, written in Rust.☆69Updated last year
- Work in progress rust bindings to ggml☆12Updated 2 years ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆85Updated last year
- A Rust implementation of OpenAI's Whisper model using the burn framework☆329Updated last year
- Rust library for whisper.cpp compatible Mel spectrograms☆79Updated 6 months ago
- RWKV models and examples powered by candle.☆19Updated 8 months ago
- OpenAI compatible API for serving LLAMA-2 model☆218Updated 2 years ago