lucasjinreal / CraneLinks

A Pure Rust based LLM (Any LLM based MLLM such as Spark-TTS) Inference Engine, powering by Candle framework.

☆144

Alternatives and similar repositories for Crane

Users that are interested in Crane are comparing it to the libraries listed below

Sorting:

EndlessReform / fish-speech.rs
A Fish Speech implementation in Rust, with Candle.rs
☆94Updated 2 months ago
ShelbyJenkins / llm_client
The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes
☆219Updated last week
lucasjinreal / Kokoros
🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.
☆572Updated 3 weeks ago
utilityai / llama-cpp-rs
☆329Updated this week
EricLBuehler / diffusion-rs
Blazingly fast inference of diffusion models.
☆111Updated 4 months ago
thewh1teagle / sherpa-rs
Rust bindings to https://github.com/k2-fsa/sherpa-onnx
☆199Updated 2 months ago
baehyunsol / ragit
git-like rag pipeline
☆237Updated this week
EricLBuehler / candle-vllm
Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.
☆409Updated this week
adriancable / qwen3.c
Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.
☆97Updated last month
Oxen-AI / GRPO-With-Cargo-Feedback
This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback
☆100Updated 4 months ago
jkawamoto / ctranslate2-rs
Rust bindings for OpenNMT/CTranslate2
☆33Updated last week
guidance-ai / llgtrt
TensorRT-LLM server with Structured Outputs (JSON) built with Rust
☆57Updated 3 months ago
Noveum / ai-gateway
Built for demanding AI workflows, this gateway offers low-latency, provider-agnostic access, ensuring your AI applications run smoothly a…
☆73Updated 2 months ago
ljt019 / transformers
Transformers provides a simple, intuitive interface for Rust developers who want to work with Large Language Models locally, powered by t…
☆17Updated 3 weeks ago
PlugOvr-ai / PlugOvr
AI Assistant
☆20Updated 3 months ago
edgenai / llama_cpp-rs
High-level, optionally asynchronous Rust bindings to llama.cpp
☆226Updated last year
pixelspark / poly
A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust
☆80Updated last year
NimbleEdge / sparse_transformers
Sparse Inferencing for transformer based LLMs
☆196Updated this week
inferx-net / inferx
InferX is a Inference Function as a Service Platform
☆119Updated last week
nath1295 / MLX-Textgen
A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.
☆90Updated last month
atoma-network / atoma-infer
Fast serverless LLM inference, in Rust.
☆88Updated 5 months ago
ShelbyJenkins / candle_embed
A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face
☆37Updated last year
AmineDiro / cria
OpenAI compatible API for serving LLAMA-2 model
☆218Updated last year
oxideai / mlx-rs
Unofficial Rust bindings to Apple's mlx framework
☆173Updated this week
EricLBuehler / candle-lora
Low rank adaptation (LoRA) for Candle.
☆152Updated 3 months ago
TesslateAI / TFrameX
☆152Updated last week
google-ai-edge / LiteRT-LM
☆290Updated this week
iluxu / llmbasedos
Minimal Linux OS with a Model Context Protocol (MCP) gateway to expose local capabilities to LLMs.
☆259Updated last month
amrit110 / oli
A simple, fast terminal based AI coding assistant
☆172Updated last week
JackMatthewRimmer / rust-rag-toolchain
Library for doing RAG
☆74Updated last week