Simple node proxy for llama-server that enables MCP use
☆17May 10, 2025Updated 9 months ago
Alternatives and similar repositories for llama-server_mcp_proxy
Users that are interested in llama-server_mcp_proxy are comparing it to the libraries listed below
Sorting:
- ☆17Dec 16, 2024Updated last year
- ☆15Apr 9, 2025Updated 10 months ago
- Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.☆18Jan 10, 2025Updated last year
- Understand Any Repo in seconds. Instantly generate AI-ready code digests, visualize repository structures, and chat with your codebase us…☆23Jun 25, 2025Updated 8 months ago
- ☆16May 8, 2025Updated 9 months ago
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends☆52Aug 21, 2025Updated 6 months ago
- Offline LLM chatbot with personalized memory — works on CPU with multi-session memory support.☆22Jan 10, 2026Updated last month
- ☆19Jul 4, 2025Updated 8 months ago
- a browser gui for nvidia smi☆20Mar 17, 2025Updated 11 months ago
- ☆24Aug 26, 2025Updated 6 months ago
- TLS & API keys for your LLM APIs☆20Dec 17, 2025Updated 2 months ago
- A repo for generating educational presentation videos.☆27May 13, 2025Updated 9 months ago
- A forward proxy to turn network traffic into personal memory for AI agents☆36Feb 23, 2026Updated last week
- Python language chat with Ollama models locally, anthropic and openai☆24Apr 13, 2025Updated 10 months ago
- Local RAG as a simple CLI, for standalone use or as a gptme tool☆49Feb 22, 2026Updated last week
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆29Dec 11, 2025Updated 2 months ago
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆30Apr 30, 2025Updated 10 months ago
- Random llm scripts☆37Feb 25, 2026Updated last week
- Create text chunks which end at natural stopping points without using a tokenizer☆26Nov 26, 2025Updated 3 months ago
- Analyze Reddit posts☆30Feb 27, 2025Updated last year
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagrams☆40Apr 5, 2025Updated 10 months ago
- Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.☆95Feb 16, 2026Updated 2 weeks ago
- The most feature-complete local AI workstation. Multi-GPU inference, integrated Stable Diffusion + ADetailer, voice cloning, research-gra…☆56Feb 24, 2026Updated last week
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆33Feb 12, 2025Updated last year
- Genertaes control vectors for use with llama.cpp in GGUF format.☆38Mar 19, 2025Updated 11 months ago
- A lightweight LLaMA.cpp HTTP server Docker image based on Alpine Linux.☆30Oct 3, 2025Updated 5 months ago
- llama-swap + a minimal ollama compatible api☆51Feb 13, 2026Updated 2 weeks ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Mar 15, 2025Updated 11 months ago
- interactive semantic search demo using Qwen3-0.6B-Embedding in your browser☆55Feb 25, 2026Updated last week
- AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVF…☆96Feb 15, 2026Updated 2 weeks ago
- 🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GP…☆50Feb 17, 2026Updated 2 weeks ago
- Finally, an open source Youtube Summarizer extension☆78Apr 22, 2025Updated 10 months ago
- Text to audio with Tik-Tok Voices☆13Apr 6, 2023Updated 2 years ago
- A blueprint for next-gen AI. Project Infinity uses a token-efficient, Codified Agent Protocol to create specialized, secure, and imaginat…☆25Oct 2, 2025Updated 5 months ago
- LexiCrawler is a powerful Go-based web crawling API meticulously designed to extract, clean, and transform web page content into a pristi…☆48Feb 27, 2025Updated last year
- Efforts toward giving Qwen 3 Coder 30B A3B proper agentic tool calling capabilities at or near 100% reliability.☆65Aug 10, 2025Updated 6 months ago
- An example project that demonstrates the brand new and upcoming physics related features of the plugin.☆12Feb 7, 2026Updated 3 weeks ago
- A full-stack document management and AI chat application that enables users to upload, manage, and chat with their documents using AI. Bu…☆17Aug 10, 2025Updated 6 months ago
- This project contains the original white paper for Language Construct Modeling (LCM) v1.13, authored by Vincent Shing Hin Chong. It intro…☆15Jul 23, 2025Updated 7 months ago