☆550Apr 28, 2026Updated this week
Alternatives and similar repositories for llama-cpp-rs
Users that are interested in llama-cpp-rs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- High-level, optionally asynchronous Rust bindings to llama.cpp☆246Jun 5, 2024Updated last year
- LLama.cpp rust bindings☆422Jun 27, 2024Updated last year
- Rust library for generating vector embeddings, reranking locally!☆877Updated this week
- The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes☆248Aug 6, 2025Updated 8 months ago
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆652Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Fast, flexible LLM inference☆7,074Apr 15, 2026Updated 2 weeks ago
- Fast ML inference & training for ONNX models in Rust☆2,215Updated this week
- ☆13Nov 4, 2023Updated 2 years ago
- A simple and easy-to-use library for interacting with the Ollama API.☆1,027Updated this week
- Rust bindings to https://github.com/k2-fsa/sherpa-onnx☆306Mar 8, 2026Updated last month
- Simple, efficient and cross-platform TFIDF-based text summarizer in Rust☆13Apr 12, 2024Updated 2 years ago
- Use piper TTS models in Rust☆53Mar 25, 2026Updated last month
- Rust bindings to https://github.com/ggerganov/whisper.cpp☆937Jul 30, 2025Updated 9 months ago
- 🦜️🔗LangChain for Rust, the easiest way to write LLM-based programs in Rust☆1,296Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Minimalist ML framework for Rust☆20,139Updated this week
- handle gguf files☆13Aug 14, 2025Updated 8 months ago
- A Pure Rust based LLM, VLM, VLA, TTS, OCR Inference Engine, powering by Candle & Rust. Alternate to your llama.cpp but much more simpler …☆370Updated this week
- Yet another `llama.cpp` Rust wrapper☆12Apr 23, 2026Updated last week
- Unofficial Rust bindings to Apple's mlx framework☆314Apr 18, 2026Updated 2 weeks ago
- Instant, controllable, local pre-trained AI models in Rust☆2,185Updated this week
- Kokoro TTS的Rust推理实现☆31Jan 21, 2026Updated 3 months ago
- A cross-platform inference engine for neural TTS models.☆74Nov 25, 2024Updated last year
- Burn is a next generation tensor library and Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.☆14,981Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A whisper <lib|cli|server> written in rust☆20Updated this week
- pyannote audio diarization in rust☆114Sep 7, 2025Updated 7 months ago
- Fast, streaming indexing, query, and agentic LLM applications in Rust☆692Updated this week
- Speech detection using silero vad in Rust☆32Dec 16, 2024Updated last year
- Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)☆3,057Jan 13, 2026Updated 3 months ago
- Open-source LLM/VLM load balancer and serving platform for self-hosting LLMs (and VLMs) at scale 🏓🦙 Alternative to projects like llm-d,…☆1,540Updated this week
- The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge☆1,626Feb 8, 2026Updated 2 months ago
- Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python package☆291Apr 25, 2026Updated last week
- The official Rust SDK for the Model Context Protocol☆3,349Apr 23, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A cross-platform browser ML framework.☆756Apr 2, 2026Updated last month
- A wrapper around the llama-cpp library for rust, including new Sampler API from llama-cpp.☆18Updated this week
- a fast cross platform AI inference engine 🤖 using Rust 🦀 and WebGPU 🎮☆466Jan 4, 2025Updated last year
- Tiny, no-nonsense, self-contained, Tensorflow and ONNX inference☆2,887Updated this week
- Rust bindings for the C++ api of PyTorch.☆5,369Mar 26, 2026Updated last month
- [Unmaintained, see README] An ecosystem of Rust libraries for working with large language models☆6,152Jun 24, 2024Updated last year
- `llm-chain` is a powerful rust crate for building chains in large language models allowing you to summarise text and complete complex tas…☆1,598Oct 31, 2024Updated last year