Noveum / ai-gatewayLinks
Built for demanding AI workflows, this gateway offers low-latency, provider-agnostic access, ensuring your AI applications run smoothly and quickly.
☆70Updated last month
Alternatives and similar repositories for ai-gateway
Users that are interested in ai-gateway are comparing it to the libraries listed below
Sorting:
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆97Updated 4 months ago
- Rust implementation of Surya☆58Updated 4 months ago
- Kheish: A multi-role LLM agent for tasks like code auditing, file searching, and more seamlessly leveraging RAG and extensible modules.☆141Updated 6 months ago
- git-like rag pipeline☆233Updated this week
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆80Updated last year
- The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes☆205Updated 4 months ago
- Super-simple, fully Rust powered "memory" (doc store + semantic search) for LLM projects, semantic search, etc.☆62Updated last year
- AI Assistant☆20Updated 2 months ago
- Official Rust Implementation of Model2Vec☆122Updated last week
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆187Updated 3 weeks ago
- LLM-as-SERP☆66Updated 4 months ago
- Multi-language code navigation API in a container☆84Updated 2 weeks ago
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆37Updated last year
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆55Updated 2 months ago
- Fast serverless LLM inference, in Rust.☆88Updated 4 months ago
- Run AI generated code in isolated sandboxes☆88Updated 5 months ago
- Build tools for LLMs in Rust using Model Context Protocol☆38Updated 4 months ago
- Library for doing RAG☆74Updated last month
- ☆138Updated last year
- A Pure Rust based LLM (Any LLM based MLLM such as Spark-TTS) Inference Engine, powering by Candle framework.☆131Updated last month
- Light WebUI for lm.rs☆24Updated 9 months ago
- ⚡️Lightning fast in-memory VectorDB written in rust🦀☆22Updated 4 months ago
- graph + ai in your products; reduce costs and get correct answers from your data☆47Updated 2 weeks ago
- High-scale LLM gateway, written in Rust. OpenTelemetry-based observability included☆110Updated this week
- Public API documentation from dependencies for AI coding assistants☆37Updated 5 months ago
- Semantic caching layer for your LLM applications. Reuse responses and reduce token usage.☆83Updated 3 weeks ago
- llm_utils: Basic LLM tools, best practices, and minimal abstraction.☆46Updated 4 months ago
- OpenAI compatible API for serving LLAMA-2 model☆218Updated last year
- Augment Swarm with durable execution to help you build reliable and scalable multi-agent systems.☆100Updated 8 months ago
- The MCP enterprise actors-based server or mcp-ectors for short☆31Updated last month