Simple node proxy for llama-server that enables MCP use
☆19May 10, 2025Updated 11 months ago
Alternatives and similar repositories for llama-server_mcp_proxy
Users that are interested in llama-server_mcp_proxy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16May 8, 2025Updated 11 months ago
- ☆17Dec 16, 2024Updated last year
- Understand Any Repo in seconds. Instantly generate AI-ready code digests, visualize repository structures, and chat with your codebase us…☆27Jun 25, 2025Updated 9 months ago
- This project contains the original white paper for Language Construct Modeling (LCM) v1.13, authored by Vincent Shing Hin Chong. It intro…☆15Jul 23, 2025Updated 8 months ago
- ☆15Mar 18, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Calibrating LLM Confidence by Probing Perturbed Representation Stability☆18Jul 5, 2025Updated 9 months ago
- Manifold-Mixup implementation for fastai V2☆17Oct 1, 2020Updated 5 years ago
- a browser gui for nvidia smi☆21Mar 17, 2025Updated last year
- Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.☆104Feb 16, 2026Updated last month
- Random llm scripts☆39Mar 28, 2026Updated 2 weeks ago
- ☆20Jul 4, 2025Updated 9 months ago
- Local RAG as a simple CLI, for standalone use or as a gptme tool☆49Mar 20, 2026Updated 3 weeks ago
- ☆24Aug 26, 2025Updated 7 months ago
- Offline LLM chatbot with personalized memory — works on CPU with multi-session memory support.☆22Jan 10, 2026Updated 3 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends☆57Aug 21, 2025Updated 7 months ago
- TLS & API keys for your LLM APIs☆20Dec 17, 2025Updated 3 months ago
- interactive semantic search demo using Qwen3-0.6B-Embedding in your browser☆58Feb 25, 2026Updated last month
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Feb 9, 2024Updated 2 years ago
- Vector functions and indexing for SQLite☆10Mar 26, 2023Updated 3 years ago
- Python language chat with Ollama models locally, anthropic and openai☆24Mar 5, 2026Updated last month
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆29Dec 11, 2025Updated 4 months ago
- Efforts toward giving Qwen 3 Coder 30B A3B proper agentic tool calling capabilities at or near 100% reliability.☆63Aug 10, 2025Updated 8 months ago
- Controllable Language Model Interactions in TypeScript☆10May 17, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This script processes a grid image generated with the 4lph4bet family of LoRAs for Stable Diffusion 1.5 for font creation using Calligrap…☆38Jun 25, 2024Updated last year
- A repo for generating educational presentation videos.☆27May 13, 2025Updated 11 months ago
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagrams☆42Apr 5, 2025Updated last year
- An AI tool designed to generate explanations for every file in a project☆14Mar 7, 2025Updated last year
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Mar 6, 2026Updated last month
- Analyze Reddit posts☆30Feb 27, 2025Updated last year
- Finally, an open source Youtube Summarizer extension☆77Apr 22, 2025Updated 11 months ago
- llama-swap + a minimal ollama compatible api☆56Mar 14, 2026Updated last month
- deadsimple immersive navigation: a single-player-verse component☆15Mar 11, 2026Updated last month
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVF…☆109Feb 15, 2026Updated last month
- Genertaes control vectors for use with llama.cpp in GGUF format.☆39Mar 19, 2025Updated last year
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- ☆24Mar 26, 2026Updated 2 weeks ago
- Python script to backup your VK profile☆28Sep 17, 2025Updated 6 months ago
- xml_to_json(xml, indent) function☆13Dec 13, 2021Updated 4 years ago
- “There is no such thing as a moral or an immoral book. Books are well written, or badly written.” I want to find all the well written con…☆20Nov 6, 2024Updated last year