Noveum / ai-gatewayLinks
Built for demanding AI workflows, this gateway offers low-latency, provider-agnostic access, ensuring your AI applications run smoothly and quickly.
☆73Updated 2 months ago
Alternatives and similar repositories for ai-gateway
Users that are interested in ai-gateway are comparing it to the libraries listed below
Sorting:
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆101Updated 5 months ago
- Kheish: A multi-role LLM agent for tasks like code auditing, file searching, and more seamlessly leveraging RAG and extensible modules.☆141Updated 7 months ago
- A DSPy rewrite to(not port) Rust☆56Updated last week
- git-like rag pipeline☆243Updated 3 weeks ago
- Rust implementation of Surya☆60Updated 5 months ago
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆37Updated last year
- Super-simple, fully Rust powered "memory" (doc store + semantic search) for LLM projects, semantic search, etc.☆62Updated last year
- Fast serverless LLM inference, in Rust.☆88Updated 5 months ago
- The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes☆229Updated 2 weeks ago
- AI Assistant☆20Updated 4 months ago
- LLM-as-SERP☆69Updated 5 months ago
- ⚡️Lightning fast in-memory VectorDB written in rust🦀☆23Updated 5 months ago
- Augment Swarm with durable execution to help you build reliable and scalable multi-agent systems.☆105Updated 9 months ago
- Official Rust Implementation of Model2Vec☆124Updated last month
- Self-hosted alternative to OpenAI's Responses API compatible with Agents SDK and works with all model providers (Claude/R1/Qwen/Ollama et…☆75Updated 4 months ago
- Light WebUI for lm.rs☆24Updated 10 months ago
- A memory framework for Large Language Models and Agents.☆182Updated 7 months ago
- Blockoli is a high-performance tool for code indexing, embedding generation and semantic search tool for use with LLMs.☆141Updated last year
- High-scale LLM gateway, written in Rust. OpenTelemetry-based observability included☆124Updated 2 weeks ago
- Semantic caching layer for your LLM applications. Reuse responses and reduce token usage.☆85Updated 2 months ago
- George is an API leveraging AI to make it easy to control a computer with natural language.☆48Updated 7 months ago
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆58Updated 4 months ago
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆27Updated 3 weeks ago
- High-performance framework for building interactive multi-agent workflow systems in Rust☆122Updated 3 weeks ago
- LLMap solves context extraction for large codebases☆108Updated 6 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆92Updated last month
- A lightweight Agentic AI framework which works for Mac/Linux/WSL☆40Updated last month
- OpenAI compatible API for serving LLAMA-2 model☆218Updated last year
- Split code into semantic chunks☆47Updated 11 months ago
- Library for doing RAG☆75Updated 2 weeks ago