EricLBuehler / mistral.rs
Blazingly fast LLM inference.
☆5,601Updated this week
Alternatives and similar repositories for mistral.rs
Users that are interested in mistral.rs are comparing it to the libraries listed below
Sorting:
- Deep learning at the speed of light.☆1,542Updated 2 weeks ago
- A vector search SQLite extension that runs anywhere!☆5,568Updated 3 months ago
- Local AI API Platform☆2,655Updated last week
- Minimalist ML framework for Rust☆17,161Updated this week
- Burn is a next generation Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.☆11,044Updated this week
- A blazing fast inference solution for text embeddings models☆3,543Updated last week
- AICI: Prompts as (Wasm) Programs☆2,021Updated 3 months ago
- [Unmaintained, see README] An ecosystem of Rust libraries for working with large language models☆6,114Updated 10 months ago
- ☆2,939Updated 8 months ago
- tiny vision language model☆7,934Updated last month
- Minimal LLM inference in Rust☆983Updated 6 months ago
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆22,551Updated this week
- PyTorch native post-training library☆5,171Updated this week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,045Updated 2 weeks ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,972Updated last week
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,746Updated 4 months ago
- Tools for merging pretrained large language models.☆5,646Updated this week
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,166Updated this week
- Distributed LLM and StableDiffusion inference for mobile, desktop and server.☆2,851Updated 6 months ago
- The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge☆1,384Updated this week
- SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersec…☆15,787Updated this week
- BionicGPT is an on-premise replacement for ChatGPT, offering the advantages of Generative AI while maintaining strict data confidentialit…☆2,159Updated last week
- SGLang is a fast serving framework for large language models and vision language models.☆14,188Updated this week
- Distribute and run LLMs with a single file.☆22,365Updated 3 weeks ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,935Updated 9 months ago
- the terminal client for Ollama☆1,798Updated this week
- A fast llama2 decoder in pure Rust.☆1,051Updated last year
- LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software e…☆2,751Updated 4 months ago
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆21,830Updated 2 weeks ago
- Structured Text Generation☆11,560Updated this week