A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM
☆13May 30, 2025Updated last year
Alternatives and similar repositories for llama_multiserver
Users that are interested in llama_multiserver are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- llama-swap + a minimal ollama compatible api☆60May 26, 2026Updated 2 weeks ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆90Jun 6, 2026Updated last week
- fast state-of-the-art speech models and a runtime that runs anywhere 💥☆57Feb 10, 2026Updated 4 months ago
- Nice Learning is a completely free custom theme for Moodle 5.x. It’s clean, user-friendly, and fully compatible with right-to-left (RTL) …☆20May 22, 2026Updated 3 weeks ago
- Static analysis toolkit for LLM agent plans☆13Aug 9, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The official Python library for Formulaic☆18Apr 25, 2024Updated 2 years ago
- MiniLM (BERT) embeddings from scratch☆20Aug 14, 2025Updated 10 months ago
- Simple HTML template library for C++☆14Feb 3, 2021Updated 5 years ago
- Testing various libraries/approaches for compressing floating point data☆15Apr 18, 2023Updated 3 years ago
- ☆14Dec 3, 2023Updated 2 years ago
- Transfer data through a unidirectional network (i.e., a data diode)☆13Apr 7, 2026Updated 2 months ago
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆15Apr 15, 2024Updated 2 years ago
- Tiny Llama model trained to play chess☆30Jul 22, 2025Updated 10 months ago
- ☆10Dec 29, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆89Sep 22, 2024Updated last year
- 🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GP…☆58Apr 27, 2026Updated last month
- Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.☆108Feb 16, 2026Updated 3 months ago
- A Python-based chat application utilizing a Local LLM to generate complex thought chains for various use cases such as product developmen…☆20Feb 18, 2026Updated 3 months ago
- This repository contains the code and other resources used in OpenAI GPT for Python Developers (2nd Edition)☆12Apr 23, 2024Updated 2 years ago
- Mod that makes bots roam more in SPT☆14May 27, 2025Updated last year
- S4IL's Open Sourced Hill Climb Racing Hack Menu made using Python☆34Oct 29, 2025Updated 7 months ago
- Browse, search, and visualize ONNX models.☆35May 6, 2025Updated last year
- Writingway v2.0 - Writingway, but rebuild in Java/HTML instead of Python, with much improved UI. Link to our Discord server: https://disc…☆67May 16, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Video: https://youtu.be/hpR1vvaQJaM☆16May 3, 2026Updated last month
- Manim animations for Youtube and teaching☆10Mar 11, 2025Updated last year
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆52Feb 10, 2026Updated 4 months ago
- Just A Rather Very Intelligent System (J.A.R.V.I.S). Built by members of Cogito NTNU. Includes core extensions and functionality. Extra f…☆15Apr 30, 2025Updated last year
- PersonAi is a local-first desktop app that lets you create and chat with AI-powered characters. Built with Tauri, React, Rust, Go, and Py…☆30Aug 25, 2025Updated 9 months ago
- ☆23May 14, 2026Updated last month
- AI management tool☆119Nov 9, 2024Updated last year
- ACE-Step: A Step Towards Music Generation Foundation Model☆50May 20, 2025Updated last year
- An extension for SwarmUI that allows you to connect to Ollama, OpenAI, and OpenRouter to use vision models for image analysis to create i…☆30May 31, 2026Updated 2 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- CLI utility that extracting signatures via searching by hash from any file(esp. binary)☆14May 14, 2025Updated last year
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Dec 1, 2024Updated last year
- Teaching AI to play the classic text adventure Zork using Large Language Models☆37Apr 5, 2026Updated 2 months ago
- Identifying and distinguishing spam SMS and Email using the multinomial Naïve Bayes model.☆17Jun 1, 2025Updated last year
- ☆31Nov 5, 2024Updated last year
- The program consists of automated tests written in Java using Selenium WebDriver to test key functionalities in the e-learning system.☆13Jun 8, 2025Updated last year
- 🎮 Material You TUI for monitoring NVIDIA GPUs☆58Jan 16, 2026Updated 4 months ago