EricLBuehler / mistral.rsLinks

Blazingly fast LLM inference.

☆6,149

Alternatives and similar repositories for mistral.rs

Users that are interested in mistral.rs are comparing it to the libraries listed below

Sorting:

luminal-ai / luminal
Deep learning at the speed of light.
☆2,564Updated last week
menloresearch / cortex.cpp
Local AI API Platform
☆2,758Updated 3 months ago
rustformers / llm
[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models
☆6,134Updated last year
floneum / floneum
Instant, controllable, local pre-trained AI models in Rust
☆2,041Updated this week
bionic-gpt / bionic-gpt
Bionic is an on-premise replacement for ChatGPT, offering the advantages of Generative AI while maintaining strict data confidentiality
☆2,265Updated this week
huggingface / candle
Minimalist ML framework for Rust
☆18,341Updated this week
asg017 / sqlite-vec
A vector search SQLite extension that runs anywhere!
☆6,300Updated 9 months ago
evilsocket / cake
Distributed LLM and StableDiffusion inference for mobile, desktop and server.
☆2,886Updated last year
lancedb / lancedb
Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.
☆7,762Updated last week
tracel-ai / burn
Burn is a next generation Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.
☆13,140Updated this week
intentee / paddler
Open-source LLM load balancer and serving platform for self-hosting LLMs at scale 🏓🦙
☆1,337Updated 2 weeks ago
LlamaEdge / LlamaEdge
The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge
☆1,516Updated last week
pytorch / torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
☆3,615Updated last month
huggingface / text-embeddings-inference
A blazing fast inference solution for text embeddings models
☆4,103Updated 2 weeks ago
cohere-ai / cohere-toolkit
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
☆3,133Updated 2 weeks ago
microsoft / aici
AICI: Prompts as (Wasm) Programs
☆2,050Updated 9 months ago
pykeio / ort
Fast ML inference & training for ONNX models in Rust
☆1,642Updated last week
srush / llama2.rs
A fast llama2 decoder in pure Rust.
☆1,052Updated last year
Abraxas-365 / langchain-rust
🦜️🔗LangChain for Rust, the easiest way to write LLM-based programs in Rust
☆1,116Updated 3 weeks ago
b4rtaz / distributed-llama
Distributed LLM inference. Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.
☆2,705Updated 2 weeks ago
0xPlaygrounds / rig
⚙️🦀 Build modular and scalable LLM Applications in Rust
☆4,717Updated this week
ml-explore / mlx-examples
Examples in the MLX framework
☆7,926Updated 2 weeks ago
tairov / llama2.mojo
Inference Llama 2 in one file of pure 🔥
☆2,118Updated last week
n0-computer / iroh
peer-2-peer that just works
☆7,045Updated last week
LaurentMazare / tch-rs
Rust bindings for the C++ api of PyTorch.
☆5,073Updated last week
ggozad / oterm
the terminal client for Ollama
☆2,219Updated last week
samuel-vitorino / lm.rs
Minimal LLM inference in Rust
☆1,013Updated last year
lavague-ai / LaVague
Large Action Model framework to develop AI Web Agents
☆6,184Updated 9 months ago
predibase / lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
☆3,518Updated 5 months ago
pepperoni21 / ollama-rs
A simple and easy-to-use library for interacting with the Ollama API.
☆924Updated last week