A Pure Rust based LLM, VLM, VLA, TTS, OCR Inference Engine, powering by Candle & Rust. Alternate to your llama.cpp but much more simpler and cleaner..
☆412Jun 25, 2026Updated last week
Alternatives and similar repositories for Crane
Users that are interested in Crane are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Rust standalone inference of Namo-500M series models. Extremly tiny, runing VLM on CPU.☆24Mar 12, 2025Updated last year
- Kokoro TTS的Rust推理实现☆33Jun 1, 2026Updated last month
- 🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.☆793Jun 19, 2026Updated last week
- ☆24Jan 22, 2025Updated last year
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆15Jun 16, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Rust 🦀 port of the Hugging Face smolagents library.☆43Mar 26, 2025Updated last year
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆24Apr 1, 2025Updated last year
- Fast, flexible LLM inference☆7,362Jun 25, 2026Updated last week
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated last year
- Rust bindings to https://github.com/k2-fsa/sherpa-onnx☆310Mar 8, 2026Updated 3 months ago
- ☆599Updated this week
- The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning☆27Jul 27, 2025Updated 11 months ago
- Implementation of Qwen3-ASR-0.6B in GGML☆99Feb 10, 2026Updated 4 months ago
- Graph model execution API for Candle☆18Jul 27, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆51Feb 19, 2025Updated last year
- Finetune Sesame's CSM 1B model, for fun and profit☆17Mar 24, 2025Updated last year
- Rust bindings for OpenNMT/CTranslate2☆57Jun 20, 2026Updated last week
- the rent a hal project for AI☆23Jun 25, 2026Updated last week
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Dec 1, 2024Updated last year
- Frontend for Uplink☆12Apr 22, 2025Updated last year
- ☆15Mar 18, 2026Updated 3 months ago
- ☆65Jun 24, 2025Updated last year
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆48May 3, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A forward proxy to turn network traffic into personal memory for AI agents☆38Mar 30, 2026Updated 3 months ago
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆682Updated this week
- A simple, fast terminal based AI coding assistant☆246Jun 24, 2026Updated last week
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆44Aug 3, 2025Updated 10 months ago
- A pure and fast NumPy implementation of Mamba with cache support.☆18Jun 16, 2024Updated 2 years ago
- Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.☆20Jan 10, 2025Updated last year
- superfast text to speech in any voice☆62Feb 16, 2026Updated 4 months ago
- The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes☆253Aug 6, 2025Updated 10 months ago
- ☆19Aug 19, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A collection of experimental Retrieval Augmented Generation (RAG) Techniques to elevate your pipelines, all with code and intuitive expla…☆37Jul 21, 2025Updated 11 months ago
- ☆179Aug 10, 2025Updated 10 months ago
- AI Assistant☆21Feb 21, 2026Updated 4 months ago
- ☆49Mar 17, 2025Updated last year
- High-level, optionally asynchronous Rust bindings to llama.cpp☆246Jun 5, 2024Updated 2 years ago
- ☆20Jul 4, 2025Updated 11 months ago
- A lightweight LLaMA.cpp HTTP server Docker image based on Alpine Linux.☆39Jun 8, 2026Updated 3 weeks ago