extopico / llama-server_mcp_proxyLinks
Simple node proxy for llama-server that enables MCP use
☆17Updated 9 months ago
Alternatives and similar repositories for llama-server_mcp_proxy
Users that are interested in llama-server_mcp_proxy are comparing it to the libraries listed below
Sorting:
- The most feature-complete local AI workstation. Multi-GPU inference, integrated Stable Diffusion + ADetailer, voice cloning, research-gra…☆55Updated last week
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends☆51Updated 5 months ago
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Updated this week
- ☆17Updated last year
- ☆19Updated 7 months ago
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆29Updated 2 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Updated last year
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆88Updated last week
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆33Updated 11 months ago
- Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.☆92Updated this week
- ☆24Updated last year
- ☆109Updated 5 months ago
- Python language chat with Ollama models locally, anthropic and openai☆24Updated 9 months ago
- Running Microsoft's BitNet via Electron, React & Astro☆52Updated 4 months ago
- ☆51Updated 11 months ago
- Capture, tag, and search images locally with OSS models.☆44Updated last year
- Web application for roleplaying with AI-powered characters☆67Updated 7 months ago
- ☆90Updated 2 months ago
- ☆15Updated 10 months ago
- Chatbot-to-speech using Orpheus TTS model. Interactive console app.☆21Updated 9 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆86Updated last year
- A real-time shared memory layer for multi-agent LLM systems.☆53Updated last month
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆29Updated last year
- ☆18Updated 5 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆127Updated last year
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆103Updated 5 months ago
- A stock market bot that automatically, once a day, rebalances your Robinhood portfolio by gathering information about each ticker in the …☆60Updated 11 months ago
- Cognito: Supercharge your Chrome browser with AI. Guide, query, and control everything using natural language.☆57Updated last month
- Orpheus Chat WebUI☆76Updated 10 months ago
- 🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GP…☆50Updated 2 months ago