KerfuffleV2 / llm-samplersLinks
A collection of LLM token samplers in Rust
☆18Updated last year
Alternatives and similar repositories for llm-samplers
Users that are interested in llm-samplers are comparing it to the libraries listed below
Sorting:
- ☆58Updated 2 years ago
- An extension library to Candle that provides PyTorch functions not currently available in Candle☆40Updated last year
- A Fish Speech implementation in Rust, with Candle.rs☆92Updated last month
- A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit eva…☆93Updated last year
- ☆23Updated 3 months ago
- A Keras like abstraction layer on top of the Rust ML framework candle☆23Updated last year
- Rust implementation of Huggingface transformers pipelines using onnxruntime backend with bindings to C# and C.☆39Updated 2 years ago
- High-level, optionally asynchronous Rust bindings to llama.cpp☆222Updated last year
- ☆20Updated 9 months ago
- Rust library for whisper.cpp compatible Mel spectrograms☆72Updated 2 months ago
- ☆32Updated 2 years ago
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆80Updated last year
- A collection of optimisers for use with candle☆36Updated last month
- Low rank adaptation (LoRA) for Candle.☆151Updated 2 months ago
- Rust standalone inference of Namo-500M series models. Extremly tiny, runing VLM on CPU.☆24Updated 4 months ago
- Implementing the BitNet model in Rust☆37Updated last year
- Port of Andrej Karpathy's minbpe to Rust☆25Updated last year
- Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python package☆214Updated 3 weeks ago
- ☆89Updated 6 months ago
- Experimental compiler for deep learning models☆68Updated last month
- A high-performance constrained decoding engine based on context free grammar in Rust☆54Updated last month
- Rust port of annoy (https://github.com/spotify/annoy)☆44Updated last month
- 8-bit floating point types for Rust☆47Updated 4 months ago
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆37Updated last year
- LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!☆107Updated last year
- a GGUF file parser☆12Updated 2 months ago
- GGML bindings that aim to be idiomatic Rust rather than directly corresponding to the C/C++ interface☆19Updated last year
- A minimal OpenCL, CUDA, Vulkan and host CPU array manipulation engine / framework.☆74Updated this week
- A Voice Activity Detector rust library using the Silero VAD model.☆44Updated 3 months ago
- Blazingly fast inference of diffusion models.☆111Updated 3 months ago