Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm compliant.
☆174Apr 8, 2026Updated last month
Alternatives and similar repositories for open-responses-server
Users that are interested in open-responses-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An MCP server that scales development into controllable agentic, recursive flows, and build a feature from bottom-up☆46Jun 29, 2025Updated 10 months ago
- ☆24Jan 22, 2025Updated last year
- Instructions for setting up SuperGateway MCP servers in docker containers for docker deployments of LibreChat☆24Feb 7, 2025Updated last year
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆60Dec 1, 2024Updated last year
- ☆10Feb 23, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Embodied AI system combining real-time multimodal perception, speech-to-speech interaction, and autonomous awareness on NVIDIA Jetson har…☆228Apr 8, 2026Updated last month
- Self-hosted alternative to OpenAI's Responses API compatible with Agents SDK and works with all model providers (Claude/R1/Qwen/Ollama et…☆219Apr 2, 2025Updated last year
- Use Discord as your interface for ollama☆12Jan 30, 2024Updated 2 years ago
- Fast LLM swapping with sleep/wake support, compatible with vllm, llama.cpp, etc. llama-swap fork.☆42Apr 5, 2026Updated last month
- An MCP Server to enable global access to Rememberizer☆35Apr 17, 2026Updated last month
- A forward proxy to turn network traffic into personal memory for AI agents☆38Mar 30, 2026Updated last month
- Multi-GPU device selection for LTXV2 video generation in ComfyUI☆30Jan 10, 2026Updated 4 months ago
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆16Mar 6, 2026Updated 2 months ago
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆33Oct 3, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Kubernetes operator for local LLM inference with llama.cpp, vLLM, and TGI - multi-GPU, autoscaling, air-gapped, production-ready☆88Updated this week
- Digital Assistant for Workflow Neural-inference (AI Assistant)☆19Updated this week
- General Tool-calling API Proxy☆59Mar 26, 2026Updated last month
- HA Komodo integration☆32May 15, 2026Updated last week
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆35May 11, 2026Updated last week
- Hill Space is All You Need☆17Jul 11, 2025Updated 10 months ago
- The High Performance LLM Native Mock Server☆26Apr 26, 2026Updated 3 weeks ago
- Fork of the javax.media.j3d package☆12May 6, 2026Updated 2 weeks ago
- Port of Alex's Mobs to Fabric. Has some changes for Crimecraft S3, so don't expect this to be 1:1.☆14Dec 15, 2025Updated 5 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Your Interface to Intelligence☆49Apr 23, 2026Updated last month
- Various scripts for working with local LLMs☆16Oct 19, 2023Updated 2 years ago
- Unified API platform for free access to enterprise-grade AI models from Google, Groq, and OpenRouter. Industrial-ready integration with h…☆13Mar 14, 2025Updated last year
- Open-source iOS app connecting Meta Ray-Ban smart glasses to AI assistants (OpenClaw + Gemini Live)☆44Apr 6, 2026Updated last month
- A simple, "Ollama-like" tool for managing and running GGUF language models from your terminal.☆24Jan 2, 2026Updated 4 months ago
- A Pure Rust based LLM, VLM, VLA, TTS, OCR Inference Engine, powering by Candle & Rust. Alternate to your llama.cpp but much more simpler …☆391May 4, 2026Updated 2 weeks ago
- A fork of the OpenGL® Registry for code generation tools and upstreaming proposed fixes.☆16Oct 31, 2019Updated 6 years ago
- Fork of Triton repository for OpenXLA uses of the Triton language and compiler☆15Feb 24, 2026Updated 2 months ago
- chrome & firefox extension to chat with webpages: local llms☆130Dec 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Some code and extra information about the paper "Continued pre-training of LLMs for Portuguese and Government domain: A proposal for prod…☆10Mar 9, 2024Updated 2 years ago
- Ja[va] C call[ing]☆12May 4, 2017Updated 9 years ago
- a discord user installable app for ai response using ollama, JDA-5 and LangChain4J☆21Jan 9, 2025Updated last year
- ☆16Dec 29, 2018Updated 7 years ago
- Master thesis work: explaining deep reinforcement learning policies☆10Aug 27, 2020Updated 5 years ago
- A python framework to streamline your ARC challenge solutions. From graphical displays to optimized Kaggle submissions☆13Oct 17, 2024Updated last year
- win32 native frontend for llama-cli☆14Nov 2, 2024Updated last year