Simple node proxy for llama-server that enables MCP use
☆19May 10, 2025Updated 10 months ago
Alternatives and similar repositories for llama-server_mcp_proxy
Users that are interested in llama-server_mcp_proxy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16May 8, 2025Updated 10 months ago
- ☆17Dec 16, 2024Updated last year
- Understand Any Repo in seconds. Instantly generate AI-ready code digests, visualize repository structures, and chat with your codebase us…☆26Jun 25, 2025Updated 8 months ago
- This project contains the original white paper for Language Construct Modeling (LCM) v1.13, authored by Vincent Shing Hin Chong. It intro…☆15Jul 23, 2025Updated 8 months ago
- One stop shop - Local-first RAG stack with intelligent polyglot-code/docs, remote code execution, local llama enrichment, progressive dis…☆31Feb 17, 2026Updated last month
- ☆15Mar 18, 2026Updated last week
- Calibrating LLM Confidence by Probing Perturbed Representation Stability☆17Jul 5, 2025Updated 8 months ago
- a browser gui for nvidia smi☆20Mar 17, 2025Updated last year
- Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.☆100Feb 16, 2026Updated last month
- Random llm scripts☆37Updated this week
- ☆19Jul 4, 2025Updated 8 months ago
- Local RAG as a simple CLI, for standalone use or as a gptme tool☆49Updated this week
- ☆24Aug 26, 2025Updated 6 months ago
- A lightweight LLaMA.cpp HTTP server Docker image based on Alpine Linux.☆32Oct 3, 2025Updated 5 months ago
- Offline LLM chatbot with personalized memory — works on CPU with multi-session memory support.☆22Jan 10, 2026Updated 2 months ago
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends☆55Aug 21, 2025Updated 7 months ago
- TLS & API keys for your LLM APIs☆20Dec 17, 2025Updated 3 months ago
- interactive semantic search demo using Qwen3-0.6B-Embedding in your browser☆57Feb 25, 2026Updated 3 weeks ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Feb 9, 2024Updated 2 years ago
- The most feature-complete local AI workstation. Multi-GPU inference, integrated Stable Diffusion + ADetailer, voice cloning, research-gra…☆57Feb 24, 2026Updated last month
- Vector functions and indexing for SQLite☆10Mar 26, 2023Updated 2 years ago
- ☆93Dec 9, 2025Updated 3 months ago
- Python language chat with Ollama models locally, anthropic and openai☆24Mar 5, 2026Updated 2 weeks ago
- Prometheus exporter for Linux based GDDR6/GDDR6X VRAM and GPU Core Hot spot temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆24Oct 2, 2024Updated last year
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆29Dec 11, 2025Updated 3 months ago
- Efforts toward giving Qwen 3 Coder 30B A3B proper agentic tool calling capabilities at or near 100% reliability.☆64Aug 10, 2025Updated 7 months ago
- Controllable Language Model Interactions in TypeScript☆10May 17, 2024Updated last year
- This script processes a grid image generated with the 4lph4bet family of LoRAs for Stable Diffusion 1.5 for font creation using Calligrap…☆38Jun 25, 2024Updated last year
- A repo for generating educational presentation videos.☆27May 13, 2025Updated 10 months ago
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagrams☆41Apr 5, 2025Updated 11 months ago
- An AI tool designed to generate explanations for every file in a project☆14Mar 7, 2025Updated last year
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Mar 6, 2026Updated 2 weeks ago
- llama-swap + a minimal ollama compatible api☆54Mar 14, 2026Updated last week
- Analyze Reddit posts☆30Feb 27, 2025Updated last year
- Finally, an open source Youtube Summarizer extension☆77Apr 22, 2025Updated 11 months ago
- deadsimple immersive navigation: a single-player-verse component☆15Mar 11, 2026Updated last week
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- A blueprint for next-gen AI. Project Infinity uses a token-efficient, Codified Agent Protocol to create specialized, secure, and imaginat…☆26Mar 13, 2026Updated last week
- Genertaes control vectors for use with llama.cpp in GGUF format.☆39Mar 19, 2025Updated last year