A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM
☆12May 30, 2025Updated last year
Alternatives and similar repositories for llama_multiserver
Users that are interested in llama_multiserver are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A PHP library for Time-based One-Time Password (TOTP) authentication☆30Sep 5, 2025Updated 9 months ago
- llama-swap + a minimal ollama compatible api☆61May 26, 2026Updated last month
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆89Updated this week
- ☆12May 30, 2025Updated last year
- Crashbench is a LLM benchmark to measure bug-finding and reporting capabilities of LLMs☆14Mar 8, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- fast state-of-the-art speech models and a runtime that runs anywhere 💥☆58Feb 10, 2026Updated 4 months ago
- Nice Learning is a completely free custom theme for Moodle 5.x. It’s clean, user-friendly, and fully compatible with right-to-left (RTL) …☆20May 22, 2026Updated last month
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆36Jan 18, 2026Updated 5 months ago
- Static analysis toolkit for LLM agent plans☆13Aug 9, 2025Updated 10 months ago
- Qt and QML based Close Combat-like game.☆16Aug 3, 2013Updated 12 years ago
- The official Python library for Formulaic☆18Apr 25, 2024Updated 2 years ago
- Creating diff that supports wildcard produced by LLMs☆16Sep 18, 2024Updated last year
- Testing various libraries/approaches for compressing floating point data☆15Apr 18, 2023Updated 3 years ago
- Transfer data through a unidirectional network (i.e., a data diode)☆13Apr 7, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆15Apr 15, 2024Updated 2 years ago
- Tiny Llama model trained to play chess☆31Jul 22, 2025Updated 11 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆89Sep 22, 2024Updated last year
- 🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GP…☆59Jun 10, 2026Updated 3 weeks ago
- Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.☆107Feb 16, 2026Updated 4 months ago
- Large-Language-Model to Machine Interface project.☆19Dec 5, 2023Updated 2 years ago
- A Python-based chat application utilizing a Local LLM to generate complex thought chains for various use cases such as product developmen…☆20Feb 18, 2026Updated 4 months ago
- Config files for my GitHub profile.☆55Jun 16, 2026Updated 2 weeks ago
- Mod that makes bots roam more in SPT☆14May 27, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- S4IL's Open Sourced Hill Climb Racing Hack Menu made using Python☆32Oct 29, 2025Updated 8 months ago
- Manim animations for Youtube and teaching☆10Mar 11, 2025Updated last year
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆52Feb 10, 2026Updated 4 months ago
- Just A Rather Very Intelligent System (J.A.R.V.I.S). Built by members of Cogito NTNU. Includes core extensions and functionality. Extra f…☆15Apr 30, 2025Updated last year
- ✈️ Бесплатные серверы Shadowsocks ✈️ ✈️ Бесплатные узлы ✈️ ✈️ Шаринг серверов – полностью бесплатно. ✈️ Лично проверено! Эти узлы досту…☆19Apr 28, 2025Updated last year
- PersonAi is a local-first desktop app that lets you create and chat with AI-powered characters. Built with Tauri, React, Rust, Go, and Py…☆30Aug 25, 2025Updated 10 months ago
- Docker container with squid proxy and openvpn client☆15Mar 25, 2018Updated 8 years ago
- Venus is a neural network aim assist that uses real-time object detection accelerated with CUDA on Nvidia GPUs.☆11Jul 25, 2024Updated last year
- CURL builder - Графический конструктор командной строки для 1С:Предприятие 8☆14Jul 22, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- AI management tool☆119Nov 9, 2024Updated last year
- A full-stack Instagram clone built with React.js for the frontend, Spring Boot for the backend, and MySQL as the database. Includes user …☆23Jan 2, 2026Updated 6 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆50May 20, 2025Updated last year
- A personal collection/modlist/catalogue for (subjectively) best SPT mods.☆19Feb 10, 2025Updated last year
- An extension for SwarmUI that allows you to connect to Ollama, OpenAI, and OpenRouter to use vision models for image analysis to create i…☆30May 31, 2026Updated last month
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- Basic Chatbot Code in Python Greeting Responses: "Hello," "Hi," etc. Basic Questions: Like "How are you?" or "What's your name?" Exit Con…☆15Feb 7, 2026Updated 4 months ago