KerfuffleV2 / llm-samplersLinks

A collection of LLM token samplers in Rust

☆18

Alternatives and similar repositories for llm-samplers

Users that are interested in llm-samplers are comparing it to the libraries listed below

Sorting:

edgenai / llama_cpp-rs
High-level, optionally asynchronous Rust bindings to llama.cpp
☆232Updated last year
Narsil / smelte-rs
☆58Updated 2 years ago
Noeda / rllama
Rust+OpenCL+AVX2 implementation of LLaMA inference code
☆550Updated last year
EricLBuehler / candle-lora
Low rank adaptation (LoRA) for Candle.
☆166Updated 6 months ago
EndlessReform / fish-speech.rs
A Fish Speech implementation in Rust, with Candle.rs
☆101Updated 5 months ago
KerfuffleV2 / smolrsrwkv
A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit eva…
☆94Updated 2 years ago
wavey-ai / mel-spec
Rust library for whisper.cpp compatible Mel spectrograms
☆79Updated 6 months ago
utilityai / llama-cpp-rs
☆392Updated last week
mdrokz / rust-llama.cpp
LLama.cpp rust bindings
☆408Updated last year
KGrewal1 / candle-optimisers
A collection of optimisers for use with candle
☆43Updated 3 months ago
robertknight / rten
ONNX neural network inference engine
☆257Updated this week
Gadersd / whisper-burn
A Rust implementation of OpenAI's Whisper model using the burn framework
☆329Updated last year
Gadersd / stable-diffusion-burn
Stable Diffusion v1.4 ported to Rust's burn framework
☆342Updated last year
EricLBuehler / candle-vllm
Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.
☆514Updated this week
huggingface / hf-hub
Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python package
☆242Updated last month
LaurentMazare / glim
☆19Updated last year
npc-engine / edge-transformers
Rust implementation of Huggingface transformers pipelines using onnxruntime backend with bindings to C# and C.
☆39Updated 2 years ago
gaxler / llama2.rs
Inference Llama 2 in one file of pure Rust 🦀
☆233Updated 2 years ago
KerfuffleV2 / ggml-sys-bleedingedge
Bleeding edge low level Rust binding for GGML
☆16Updated last year
EricLBuehler / safetensors_explorer
CLI utility to inspect and explore .safetensors and .gguf files
☆34Updated 2 weeks ago
coreylowman / llama-dfdx
LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!
☆110Updated 2 years ago
LaurentMazare / diffusers-rs
An implementation of the diffusers api in Rust
☆580Updated last year
pixelspark / poly
A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust
☆79Updated last year
mokeyish / candle-ext
An extension library to Candle that provides PyTorch functions not currently available in Candle
☆40Updated last year
ToluClassics / candle-tutorial
Tutorial for Porting PyTorch Transformer Models to Candle (Rust)
☆327Updated last year
zackshen / gguf
a GGUF file parser
☆16Updated 2 months ago
Narsil / bindgen_cuda
☆26Updated 7 months ago
Gadersd / llama2-burn
Llama2 LLM ported to Rust burn
☆278Updated last year
tracel-ai / burn-lm
Democratizing large model inference and training on any device.
☆167Updated 2 weeks ago
nkeenan38 / voice_activity_detector
A Voice Activity Detector rust library using the Silero VAD model.
☆56Updated 3 months ago