robbyt / llm_proxy
AI aware proxy
☆18Updated 6 months ago
Alternatives and similar repositories for llm_proxy:
Users that are interested in llm_proxy are comparing it to the libraries listed below
- Knowledge for GPTScript☆29Updated 4 months ago
- Run Structured LLM Inference with Easy Parallelism☆15Updated last month
- Vector Embedding Server in under 100 lines of code☆22Updated last year
- QLLM: A powerful CLI for seamless interaction with multiple Large Language Models. Simplify AI workflows, streamline development, and unl…☆33Updated 3 weeks ago
- The home of official Obot tools☆22Updated this week
- A library for generating structured JSON using GPT-4o.☆13Updated 7 months ago
- Document parser for RAG☆22Updated 4 months ago
- Chew is a Go library for processing various content types into markdown/plaintext.☆41Updated last month
- docker-as-code compiler☆13Updated 5 months ago
- Open Source LLM proxy that transparently captures and logs all interactions with LLM API☆52Updated 2 months ago
- convert natural language into technical diagrams☆12Updated 3 months ago
- A simple github actions script to build a llamafile and uploads to huggingface☆14Updated last year
- ☆16Updated last year
- GO GO PARSE YOUR CODE GO GO☆11Updated last month
- AI Testing Agent: Open Source AI Agent for Software Testing☆14Updated 3 months ago
- 📡 Deploy AI models and apps to Kubernetes without developing a hernia☆32Updated 9 months ago
- Python module for running GPTScript☆14Updated this week
- 🦄 Use GPT to generate and label data☆25Updated 10 months ago
- The DPAB-α Benchmark☆19Updated 2 months ago
- Chatroom app where messages are sent to GPT, Claude, Mistral, Together, Grok, Groq, Google, vLLM, Ollama & streamed to the frontend.☆39Updated last week
- ☆19Updated 5 months ago
- Agents and RAG workflows with little to no code☆22Updated 3 months ago
- llm plugin for Cerebras fast inference API☆23Updated last week
- Guards and protection agnostic to your model or provider☆36Updated 4 months ago
- Structured outputs from DSPy and Jinja2☆23Updated 2 months ago
- iauto is a low-code engine for building and deploying AI agents☆85Updated 4 months ago
- The reliability layer between your code and LLM providers.☆17Updated 2 months ago
- Supervised fine-tuning of Google's open-source Gemma-2B model to optimize writing Python code☆21Updated last year