LlamaEdge / rag-api-serverLinks
A RAG API server written in Rust following OpenAI specs
☆60Updated 9 months ago
Alternatives and similar repositories for rag-api-server
Users that are interested in rag-api-server are comparing it to the libraries listed below
Sorting:
- High-performance framework for building interactive multi-agent workflow systems in Rust☆236Updated 2 months ago
- ChronoMind: Redefining Vector Intelligence Through Time.☆73Updated 9 months ago
- The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge☆27Updated 10 months ago
- 🦀 A Pure Rust Framework For Building AGI (WIP).☆111Updated 3 weeks ago
- Andrej Karpathy's Let's build GPT: from scratch video & notebook implemented in Rust + candle☆76Updated last year
- Moly AI: A local + cloud AI LLM multi-platform GUI app in pure Rust☆390Updated this week
- This project is a web-based LLM (Large Language Model) chat tool developed using Rust, the Dioxus framework, and the Candle framework. It…☆102Updated last year
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆79Updated 2 years ago
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆47Updated last year
- An LLM interface (chat bot) implemented in pure Rust using HuggingFace/Candle over Axum Websockets, an SQLite Database, and a Leptos (Was…☆139Updated last year
- The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes☆242Updated 6 months ago
- Library for doing RAG☆82Updated last month
- A multi-agent framework written in Rust that enables you to build, deploy, and coordinate multiple intelligent agents☆329Updated this week
- A Pure Rust based LLM, VLM, VLA, TTS, OCR Inference Engine, powering by Candle & Rust. Alternate to your llama.cpp but much more simpler …☆244Updated last week
- A Fish Speech implementation in Rust, with Candle.rs☆108Updated 8 months ago
- Minimalistic Rust Implementation Of Model Context Protocol from Anthropic☆63Updated 6 months ago
- Extract core logic from qdrant and make it available as a library.☆63Updated last year
- Fast serverless LLM inference, in Rust.☆109Updated 3 months ago
- Rust bindings to https://github.com/k2-fsa/sherpa-onnx☆287Updated 3 months ago
- llm_utils: Basic LLM tools, best practices, and minimal abstraction.☆48Updated 11 months ago
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆587Updated last week
- Fast, streaming indexing, query, and agentic LLM applications in Rust☆656Updated this week
- Model Context Protocol (MCP) implementation in Rust☆352Updated 10 months ago
- Use multiple LLM backends in a single crate, simple builder-based configuration, and built-in prompt chaining & templating.☆140Updated 8 months ago
- AI gateway and observability server written in Rust. Designed to help optimize multi-agent workflows.☆65Updated last year
- The Google mediapipe AI library. Write AI inference applications for image recognition, text classification, audio / video processing and…☆227Updated 3 weeks ago
- Rust application development framework for native and web applications☆72Updated 7 months ago
- A comprehensive Rust translation of the code from Sebastian Raschka's Build an LLM from Scratch book.☆295Updated this week
- Rust implementation of Surya☆65Updated 11 months ago
- A powerful Rust library and CLI tool to unify and orchestrate multiple LLM, Agent and voice backends (OpenAI, Claude, Gemini, Ollama, Ele…☆303Updated this week