A Pure Rust based LLM, VLM, VLA, TTS, OCR Inference Engine, powering by Candle & Rust. Alternate to your llama.cpp but much more simpler and cleaner..
☆401May 4, 2026Updated last month
Alternatives and similar repositories for Crane
Users that are interested in Crane are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Rust standalone inference of Namo-500M series models. Extremly tiny, runing VLM on CPU.☆24Mar 12, 2025Updated last year
- Kokoro TTS的Rust推理实现☆33Jun 1, 2026Updated last week
- 🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.☆781Jun 1, 2026Updated last week
- ☆24Jan 22, 2025Updated last year
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆15May 29, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Rust 🦀 port of the Hugging Face smolagents library.☆43Mar 26, 2025Updated last year
- Game servers running on Kubernetes☆12Apr 28, 2025Updated last year
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆24Apr 1, 2025Updated last year
- Flash attention implementation Minimal CUDA implementation of Flash Attention with tiled computation and online softmax. Educational imp…☆21Dec 27, 2025Updated 5 months ago
- Fast, flexible LLM inference☆7,255Updated this week
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated last year
- Rust bindings to https://github.com/k2-fsa/sherpa-onnx☆310Mar 8, 2026Updated 3 months ago
- ☆577Updated this week
- The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning☆27Jul 27, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Graph model execution API for Candle☆18Jul 27, 2025Updated 10 months ago
- ☆51Feb 19, 2025Updated last year
- Finetune Sesame's CSM 1B model, for fun and profit☆17Mar 24, 2025Updated last year
- Rust bindings for OpenNMT/CTranslate2☆54May 31, 2026Updated last week
- A Rust-based, SenseVoiceSmall☆33Apr 27, 2026Updated last month
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Dec 1, 2024Updated last year
- Frontend for Uplink☆12Apr 22, 2025Updated last year
- ☆16May 14, 2025Updated last year
- ☆15Mar 18, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆65Jun 24, 2025Updated 11 months ago
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆48May 3, 2024Updated 2 years ago
- LLM inference in C/C++☆23Oct 4, 2024Updated last year
- A forward proxy to turn network traffic into personal memory for AI agents☆38Mar 30, 2026Updated 2 months ago
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆670May 26, 2026Updated 2 weeks ago
- A simple, fast terminal based AI coding assistant☆246May 29, 2026Updated last week
- Service for testing out the new Qwen2.5 omni model☆63Apr 30, 2025Updated last year
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆44Aug 3, 2025Updated 10 months ago
- A pure and fast NumPy implementation of Mamba with cache support.☆18Jun 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.☆19Jan 10, 2025Updated last year
- Fast ML inference & training for ONNX models in Rust☆2,313Updated this week
- The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes☆249Aug 6, 2025Updated 10 months ago
- A collection of experimental Retrieval Augmented Generation (RAG) Techniques to elevate your pipelines, all with code and intuitive expla…☆37Jul 21, 2025Updated 10 months ago
- AI Assistant☆20Feb 21, 2026Updated 3 months ago
- High-level, optionally asynchronous Rust bindings to llama.cpp☆245Jun 5, 2024Updated 2 years ago
- ☆20Jul 4, 2025Updated 11 months ago