katanemo / archgw
Arch is an intelligent prompt gateway. Engineered with (fast) LLMs for the secure handling, robust observability, and seamless integration of prompts with your APIs - outside business logic. Built by the core contributors of Envoy proxy, on Envoy.
☆757Updated this week
Related projects ⓘ
Alternatives and complementary repositories for archgw
- Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python…☆1,281Updated 2 months ago
- 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library☆1,477Updated this week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,281Updated this week
- A realtime serving engine for Data-Intensive Generative AI Applications☆918Updated this week
- The fastest way to build robust AI agents☆452Updated this week
- ➖ Stripped down, stable version of firecrawl optimized for self-hosting and ease of contribution. Billing logic and AI features are compl…☆220Updated this week
- Tech Stack for Building, Evaluating, and Deploying your LLM Application☆329Updated last week
- A suite of tools to develop RAG, semantic search, and other AI applications more easily with PostgreSQL☆2,048Updated this week
- Dynamiq is an orchestration framework for agentic AI and LLM applications☆456Updated this week
- Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Manageme…☆869Updated this week
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆1,268Updated this week
- Laminar - open-source all-in-one platform for engineering AI products. Crate data flywheel for you AI app. Traces, Evals, Datasets, Label…☆1,152Updated this week
- Stateful load balancer custom-tailored for llama.cpp 🏓🦙☆568Updated this week
- openperplex is an opensource AI search engine☆762Updated 3 months ago
- Structured information extraction from documents☆283Updated last month
- Things you can do with the token embeddings of an LLM☆1,378Updated last week
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆718Updated 3 months ago
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆499Updated 3 weeks ago
- Local realtime voice AI☆1,946Updated this week
- Building AI agents, atomically☆1,035Updated this week
- ☆251Updated 3 months ago
- Implementing the 4 agentic patterns from scratch☆765Updated 3 weeks ago
- TypeScript AI agent platform with Autonomous agents, Software developer agents, AI code review agents and more☆837Updated this week
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,468Updated this week
- Flexible and powerful multi-agent AI framework☆315Updated 2 weeks ago
- TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.☆611Updated this week
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆282Updated this week
- Optimizing inference proxy for LLMs☆1,582Updated this week
- Build and query dynamic, temporally-aware Knowledge Graphs☆1,391Updated this week
- Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, ev…☆583Updated this week