teabranch / open-responses-serverLinks
Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm compliant.
☆87Updated 2 months ago
Alternatives and similar repositories for open-responses-server
Users that are interested in open-responses-server are comparing it to the libraries listed below
Sorting:
- InferX is a Inference Function as a Service Platform☆132Updated this week
- Shared Memory Storage for Multi-Agent Systems☆123Updated 2 months ago
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆272Updated 3 weeks ago
- ☆102Updated 3 months ago
- ☆64Updated 9 months ago
- A simple tool to anonymize LLM prompts.☆64Updated 7 months ago
- A lightweight Agentic AI framework which works for Mac/Linux/WSL☆40Updated 2 months ago
- This is a cross-platform desktop application that allows you to chat with locally hosted LLMs and enjoy features like MCP support☆224Updated last month
- MCP to explore websites with llms.txt files☆67Updated 3 months ago
- Documentation site for fast-agent☆19Updated this week
- ☆48Updated last month
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆126Updated 10 months ago
- The easiest & fastest way to run LLMs in your home lab☆66Updated 3 weeks ago
- ✨ mem0 MCP Server: A memory system using mem0 for AI applications with model context protocl (MCP) integration. Enables long-term memory …☆72Updated last month
- Lightweight & fast AI inference proxy for self-hosted LLMs backends like Ollama, LM Studio and others. Designed for speed, simplicity and…☆86Updated last week
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆95Updated 2 months ago
- ☆58Updated 2 months ago
- Self-hosted alternative to OpenAI's Responses API compatible with Agents SDK and works with all model providers (Claude/R1/Qwen/Ollama et…☆81Updated 5 months ago
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆50Updated 3 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated last year
- Since OpenAI and friends refuse to give us a max_ctx param in /models, here's the current context window, input token and output token li…☆54Updated 4 months ago
- MockLLM, when you want it to do what you tell it to do!☆61Updated last week
- beep boop 🤖 (experimental)☆114Updated 8 months ago
- Call another MCP client from your MCP client. Offload context windows, delegate tasks, split between models☆28Updated 6 months ago
- ☆38Updated 2 months ago
- ☆104Updated 3 months ago
- Screenshot LLM is a Python application that leverages the power of AI to analyze screenshots. Built with PyQt6 for a user-friendly interf…☆45Updated 10 months ago
- ☆24Updated 7 months ago
- Retrieval-augmented generation (RAG) for remote & local LLM use☆45Updated 3 months ago
- An Open Source, Claude Code Like Tool, With RAG + Graph RAG + MCP Integration, and Supports Most LLMs (Incomplete But Functional & Usable…☆110Updated 2 months ago