KerfuffleV2 / llm-samplersLinks
A collection of LLM token samplers in Rust
☆18Updated last year
Alternatives and similar repositories for llm-samplers
Users that are interested in llm-samplers are comparing it to the libraries listed below
Sorting:
- High-level, optionally asynchronous Rust bindings to llama.cpp☆226Updated last year
- ☆58Updated 2 years ago
- A Fish Speech implementation in Rust, with Candle.rs☆94Updated 2 months ago
- Low rank adaptation (LoRA) for Candle.☆152Updated 3 months ago
- A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit eva…☆93Updated last year
- Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python package☆219Updated last month
- A collection of optimisers for use with candle☆37Updated last week
- Rust library for whisper.cpp compatible Mel spectrograms☆72Updated 2 months ago
- Rust+OpenCL+AVX2 implementation of LLaMA inference code☆549Updated last year
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆409Updated this week
- LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!☆108Updated 2 years ago
- Rust implementation of Huggingface transformers pipelines using onnxruntime backend with bindings to C# and C.☆39Updated 2 years ago
- ☆329Updated this week
- ONNX neural network inference engine☆222Updated this week
- Inference Llama 2 in one file of pure Rust 🦀☆233Updated last year
- Rust port of sentence-transformers (https://github.com/UKPLab/sentence-transformers)☆119Updated 10 months ago
- ☆32Updated 2 years ago
- A Rust implementation of OpenAI's Whisper model using the burn framework☆319Updated last year
- LLama.cpp rust bindings☆396Updated last year
- ☆20Updated 10 months ago
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆80Updated last year
- Rust bindings for OpenNMT/CTranslate2☆36Updated 2 weeks ago
- ☆23Updated 3 months ago
- Llama2 LLM ported to Rust burn☆280Updated last year
- Rust wrapper for Microsoft's ONNX Runtime (version 1.8)☆305Updated last year
- Rust-tokenizer offers high-performance tokenizers for modern language models, including WordPiece, Byte-Pair Encoding (BPE) and Unigram (…☆323Updated last year
- Port of Andrej Karpathy's minbpe to Rust☆25Updated last year
- Rust bindings to https://github.com/k2-fsa/sherpa-onnx☆199Updated 2 months ago
- Implementation of the RWKV language model in pure WebGPU/Rust.☆314Updated 3 weeks ago
- An extension library to Candle that provides PyTorch functions not currently available in Candle☆40Updated last year