sanctuary-systems-com/llama_multiserver

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sanctuary-systems-com/llama_multiserver)

sanctuary-systems-com / llama_multiserver

A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM

☆13

Alternatives and similar repositories for llama_multiserver

Users that are interested in llama_multiserver are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hosseinhezami / totp-authenticator
View on GitHub
A PHP library for Time-based One-Time Password (TOTP) authentication
☆31Sep 5, 2025Updated 8 months ago
kooshi / llama-swappo
View on GitHub
llama-swap + a minimal ollama compatible api
☆59Mar 14, 2026Updated 2 months ago
perk11 / large-model-proxy
View on GitHub
Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…
☆90May 11, 2026Updated 2 weeks ago
kseyhan / llama-param-pal
View on GitHub
☆12May 30, 2025Updated 11 months ago
ortegaalfredo / crashbench
View on GitHub
Crashbench is a LLM benchmark to measure bug-finding and reporting capabilities of LLMs
☆14Mar 8, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
wavify-labs / wavify-sdks
View on GitHub
fast state-of-the-art speech models and a runtime that runs anywhere 💥
☆57Feb 10, 2026Updated 3 months ago
form-asap / nice-learning
View on GitHub
Nice Learning is a completely free custom theme for Moodle 5.x. It’s clean, user-friendly, and fully compatible with right-to-left (RTL) …
☆20Apr 30, 2026Updated 3 weeks ago
cirbuk / plan-lint
View on GitHub
Static analysis toolkit for LLM agent plans
☆13Aug 9, 2025Updated 9 months ago
Mozilla-Ocho / formulaic-python
View on GitHub
The official Python library for Formulaic
☆18Apr 25, 2024Updated 2 years ago
davidsvaughn / prompt-loss-weight
View on GitHub
code for Towards Data Science article on prompt-loss-weight
☆11Jun 4, 2025Updated 11 months ago
abyesilyurt / minilm.c
View on GitHub
MiniLM (BERT) embeddings from scratch
☆20Aug 14, 2025Updated 9 months ago
monsieurgustav / NLTemplate
View on GitHub
Simple HTML template library for C++
☆14Feb 3, 2021Updated 5 years ago
Cvikli / DiffLib.jl
View on GitHub
Creating diff that supports wildcard produced by LLMs
☆16Sep 18, 2024Updated last year
ClarkuCSCI / pydiode
View on GitHub
Transfer data through a unidirectional network (i.e., a data diode)
☆13Apr 7, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
arduano / jit-demo
View on GitHub
☆14Dec 3, 2023Updated 2 years ago
AIAnytime / On-device-real-time-RAG-App
View on GitHub
On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.
☆15Apr 15, 2024Updated 2 years ago
lazy-guy / chess-llama
View on GitHub
Tiny Llama model trained to play chess
☆30Jul 22, 2025Updated 10 months ago
val-town / deno-http-worker
View on GitHub
Securely spawn Deno workers from Node.js
☆24May 12, 2026Updated last week
Sicanno / MILI
View on GitHub
☆10Dec 29, 2024Updated last year
av / klmbr
View on GitHub
klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs
☆89Sep 22, 2024Updated last year
yazon / flexllama
View on GitHub
🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GP…
☆58Apr 27, 2026Updated 3 weeks ago
Lanerra / saga
View on GitHub
Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.
☆108Feb 16, 2026Updated 3 months ago
arthurwolf / llmi
View on GitHub
Large-Language-Model to Machine Interface project.
☆19Dec 5, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
markbirss / rk3506-ubuntu
View on GitHub
Ubuntu 24.04.x OS image builder for various RK3506 SBC
☆30Apr 25, 2026Updated last month
devinambron / PyThoughtChain
View on GitHub
A Python-based chat application utilizing a Local LLM to generate complex thought chains for various use cases such as product developmen…
☆20Feb 18, 2026Updated 3 months ago
thomasmueller / xorfilter_cpp
View on GitHub
Bloom filter alternative (C++)
☆18Nov 8, 2018Updated 7 years ago
GitHub7667 / GitHub7667
View on GitHub
Config files for my GitHub profile.
☆21Mar 7, 2025Updated last year
iamPHEN / SPT-RoamingBotsd
View on GitHub
Mod that makes bots roam more in SPT
☆14May 27, 2025Updated 11 months ago
S4IL21 / Hill-Climb-Racing-Hacks
View on GitHub
S4IL's Open Sourced Hill Climb Racing Hack Menu made using Python
☆26Oct 29, 2025Updated 6 months ago
xenova / model-explorer
View on GitHub
Browse, search, and visualize ONNX models.
☆34May 6, 2025Updated last year
addmix / Godot-Aerodynamic-Tutorial
View on GitHub
Video: https://youtu.be/hpR1vvaQJaM
☆16May 3, 2026Updated 3 weeks ago
sgalkina / animations
View on GitHub
Manim animations for Youtube and teaching
☆10Mar 11, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Toy-97 / Chat-WebUI
View on GitHub
Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …
☆52Feb 10, 2026Updated 3 months ago
Synoecium / Uniform-query-console-1C
View on GitHub
Минималистичная и функциональная консоль запросов, которая выглядит и работает одинаково, как в обычных формах, так и в управляемых.
☆13Feb 25, 2019Updated 7 years ago
0xAdafang / PersonAi
View on GitHub
PersonAi is a local-first desktop app that lets you create and chat with AI-powered characters. Built with Tauri, React, Rust, Go, and Py…
☆28Aug 25, 2025Updated 8 months ago
tsari / vpn-proxy
View on GitHub
Docker container with squid proxy and openvpn client
☆15Mar 25, 2018Updated 8 years ago
PacktPublishing / Building-Smart-Chatbots-with-LangChain
View on GitHub
☆14Dec 15, 2025Updated 5 months ago
Stivo182 / curl-builder
View on GitHub
CURL builder - Графический конструктор командной строки для 1С:Предприятие 8
☆14Jul 22, 2024Updated last year
PasiKoodaa / ACE-Step-RADIO
View on GitHub
ACE-Step: A Step Towards Music Generation Foundation Model
☆50May 20, 2025Updated last year