LlamaEdge / rag-api-serverLinks

A RAG API server written in Rust following OpenAI specs

☆52

Alternatives and similar repositories for rag-api-server

Users that are interested in rag-api-server are comparing it to the libraries listed below

Sorting:

ShelbyJenkins / candle_embed
A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face
☆37Updated last year
IncredibleDevHQ / agent-panel
AI gateway and observability server written in Rust. Designed to help optimize multi-agent workflows.
☆61Updated 11 months ago
LlamaEdge / whisper-api-server
The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge
☆15Updated 3 months ago
a-agmon / rs-graph-llm
Stateful and interruptible graph execution framework for interactive agentic workflows, inspired by LangGraph but built from the ground u…
☆56Updated this week
kevin-rs / autogpt
🦀 A Pure Rust Framework For Building AGI (WIP).
☆82Updated this week
JtPerez-Acle / chrono-mind
ChronoMind: Redefining Vector Intelligence Through Time.
☆71Updated last month
JackMatthewRimmer / rust-rag-toolchain
Library for doing RAG
☆74Updated last month
AntigmaLabs / mcp-sdk
Minimalistic Rust Implementation Of Model Context Protocol from Anthropic
☆56Updated 3 months ago
Lyn-liyuan / moonweb
This project is a web-based LLM (Large Language Model) chat tool developed using Rust, the Dioxus framework, and the Candle framework. It…
☆94Updated 10 months ago
ShelbyJenkins / llm_utils
llm_utils: Basic LLM tools, best practices, and minimal abstraction.
☆46Updated 4 months ago
isala404 / Tera
Tera is an AI assistant which is tailored just for you and runs fully locally.
☆84Updated last year
tyrchen / qdrant-lib
Extract core logic from qdrant and make it available as a library.
☆59Updated last year
danielclough / fireside-chat
An LLM interface (chat bot) implemented in pure Rust using HuggingFace/Candle over Axum Websockets, an SQLite Database, and a Leptos (Was…
☆134Updated 8 months ago
moxin-org / moly
Moly: A Desktop + Cloud AI LLM GUI app in pure Rust
☆296Updated last week
mcp-ectors / mcp-ectors
The MCP enterprise actors-based server or mcp-ectors for short
☆31Updated 3 weeks ago
a-agmon / doc-embedder
A high-performance RAG indexing pipeline implemented in Rust using LanceDB and Candle
☆18Updated 10 months ago
lispking / tokio-mpmc
A multi-producer multi-consumer queue implementation based on Tokio
☆39Updated 2 weeks ago
workflow-rs / workflow-rs
Rust application development framework for native and web applications
☆63Updated last week
thewh1teagle / sherpa-rs
Rust bindings to https://github.com/k2-fsa/sherpa-onnx
☆190Updated last month
graniet / rllm
Use multiple LLM backends in a single crate, simple builder-based configuration, and built-in prompt chaining & templating.
☆132Updated last month
WasmEdge / wasmedge-rust-sdk
Embed WasmEdge functions in a Rust host app
☆32Updated 6 months ago
jordandelbar / yolo-tonic
Webcam video stream with real-time YOLO object detection, built with Ort, Tonic and Axum.
☆78Updated last month
danielgrittner / llama2-rs
LLaMA2 + Rust
☆12Updated last year
jkawamoto / ctranslate2-rs
Rust bindings for OpenNMT/CTranslate2
☆33Updated 2 months ago
ShelbyJenkins / llm_client
The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes
☆203Updated 4 months ago
pixelspark / poly
A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust
☆80Updated last year
linux-china / mcp-rs-template
Model Context Protocol (MCP) CLI server template for Rust
☆78Updated 2 months ago
viniciusf-dev / nebulla
A lightweight, high-performance text embedding model implemented in Rust.
☆66Updated last month
DioxusLabs / dioxus-ai
☆37Updated 7 months ago
restsend / rustpbx
A PBX written by rust
☆77Updated this week