Noveum / ai-gatewayLinks
Built for demanding AI workflows, this gateway offers low-latency, provider-agnostic access, ensuring your AI applications run smoothly and quickly.
☆78Updated 4 months ago
Alternatives and similar repositories for ai-gateway
Users that are interested in ai-gateway are comparing it to the libraries listed below
Sorting:
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆105Updated 6 months ago
- Kheish: A multi-role LLM agent for tasks like code auditing, file searching, and more seamlessly leveraging RAG and extensible modules.☆141Updated 9 months ago
- Rust implementation of Surya☆60Updated 7 months ago
- git-like rag pipeline☆245Updated last week
- Fast serverless LLM inference, in Rust.☆93Updated 7 months ago
- High-performance framework for building interactive multi-agent workflow systems in Rust☆146Updated 3 weeks ago
- The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes☆237Updated 2 months ago
- Official Rust Implementation of Model2Vec☆138Updated last week
- Fast rust MCP proxy between stdio and SSE☆22Updated 3 months ago
- A DSPy rewrite to(not port) Rust☆104Updated this week
- Self-hosted alternative to OpenAI's Responses API compatible with Agents SDK and works with all model providers (Claude/R1/Qwen/Ollama et…☆88Updated 6 months ago
- Super-simple, fully Rust powered "memory" (doc store + semantic search) for LLM projects, semantic search, etc.☆62Updated last year
- Augment Swarm with durable execution to help you build reliable and scalable multi-agent systems.☆107Updated 11 months ago
- AI Assistant☆20Updated 5 months ago
- High-scale LLM gateway, written in Rust. OpenTelemetry-based observability included☆136Updated 3 weeks ago
- ⚡️Lightning fast in-memory VectorDB written in rust🦀☆25Updated 6 months ago
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆41Updated last year
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆79Updated last year
- LLM-as-SERP☆70Updated 7 months ago
- Blockoli is a high-performance tool for code indexing, embedding generation and semantic search tool for use with LLMs.☆152Updated last year
- The MCP enterprise actors-based server or mcp-ectors for short☆31Updated 4 months ago
- A Pure Rust based LLM (Any LLM based MLLM such as Spark-TTS) Inference Engine, powering by Candle framework.☆163Updated last week
- Use multiple LLM backends in a single crate, simple builder-based configuration, and built-in prompt chaining & templating.☆137Updated 4 months ago
- Run AI generated code in isolated sandboxes☆109Updated 8 months ago
- ☆139Updated last year
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆60Updated 5 months ago
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆29Updated 2 months ago
- An AI agent library using Python as the common language to define executable actions and tool interfaces.☆86Updated last month
- Library for doing RAG☆77Updated 3 weeks ago
- ChronoMind: Redefining Vector Intelligence Through Time.☆72Updated 5 months ago