☆468Feb 26, 2026Updated this week
Alternatives and similar repositories for llama-cpp-rs
Users that are interested in llama-cpp-rs are comparing it to the libraries listed below
Sorting:
- High-level, optionally asynchronous Rust bindings to llama.cpp☆243Jun 5, 2024Updated last year
- LLama.cpp rust bindings☆414Jun 27, 2024Updated last year
- Rust library for vector embeddings and reranking.☆780Feb 23, 2026Updated last week
- The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes☆244Aug 6, 2025Updated 6 months ago
- Fast ML inference & training for ONNX models in Rust☆2,021Updated this week
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆603Updated this week
- Fast, flexible LLM inference☆6,623Updated this week
- Rust bindings to https://github.com/k2-fsa/sherpa-onnx☆297Nov 1, 2025Updated 4 months ago
- Rust bindings to https://github.com/ggerganov/whisper.cpp☆933Jul 30, 2025Updated 7 months ago
- A simple and easy-to-use library for interacting with the Ollama API.☆989Feb 23, 2026Updated last week
- ☆13Nov 4, 2023Updated 2 years ago
- Yet another `llama.cpp` Rust wrapper☆12Jun 19, 2024Updated last year
- handle gguf files☆12Aug 14, 2025Updated 6 months ago
- A whisper <lib|cli|server> written in rust☆20Jan 3, 2026Updated last month
- Instant, controllable, local pre-trained AI models in Rust☆2,145Updated this week
- Use piper TTS models in Rust☆48Dec 17, 2024Updated last year
- Simple, efficient and cross-platform TFIDF-based text summarizer in Rust☆13Apr 12, 2024Updated last year
- Minimalist ML framework for Rust☆19,509Updated this week
- 🦜️🔗LangChain for Rust, the easiest way to write LLM-based programs in Rust☆1,236Feb 22, 2026Updated last week
- Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python package☆265Feb 19, 2026Updated last week
- A cross-platform inference engine for neural TTS models.☆73Nov 25, 2024Updated last year
- Burn is a next generation tensor library and Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.☆14,473Updated this week
- The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge☆1,599Feb 8, 2026Updated 3 weeks ago
- A Pure Rust based LLM, VLM, VLA, TTS, OCR Inference Engine, powering by Candle & Rust. Alternate to your llama.cpp but much more simpler …☆287Feb 24, 2026Updated last week
- Open-source LLM load balancer and serving platform for self-hosting LLMs at scale 🏓🦙 Alternative to projects like llm-d, Docker Model R…☆1,467Updated this week
- Kokoro TTS的Rust推理实现☆29Jan 21, 2026Updated last month
- LLM inference in C/C++☆23Oct 4, 2024Updated last year
- Fast, streaming indexing, query, and agentic LLM applications in Rust☆667Feb 24, 2026Updated last week
- Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)☆3,042Jan 13, 2026Updated last month
- Speech detection using silero vad in Rust☆30Dec 16, 2024Updated last year
- A cross-platform browser ML framework.☆747Nov 23, 2024Updated last year
- pyannote audio diarization in rust☆103Sep 7, 2025Updated 5 months ago
- Unofficial Rust bindings to Apple's mlx framework☆261Feb 21, 2026Updated last week
- Tiny, no-nonsense, self-contained, Tensorflow and ONNX inference☆2,798Updated this week
- A Voice Activity Detector rust library using the Silero VAD model.☆62Aug 4, 2025Updated 6 months ago
- `llm-chain` is a powerful rust crate for building chains in large language models allowing you to summarise text and complete complex tas…☆1,591Oct 31, 2024Updated last year
- [Unmaintained, see README] An ecosystem of Rust libraries for working with large language models☆6,150Jun 24, 2024Updated last year
- Rust bindings for the C++ api of PyTorch.☆5,302Jan 22, 2026Updated last month
- ONNX neural network inference engine☆291Updated this week