pepijndevos / llama_multiserver
A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM
☆12Updated last month
Alternatives and similar repositories for llama_multiserver:
Users that are interested in llama_multiserver are comparing it to the libraries listed below
- V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!☆32Updated last month
- Create text chunks which end at natural stopping points without using a tokenizer☆26Updated last month
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆29Updated this week
- Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of…☆49Updated 3 months ago
- A simple light terminal style chat app that lets you use connect to your local llama.cpp server☆27Updated 7 months ago
- Demo of an "always-on" AI assistant.☆23Updated 11 months ago
- ☆43Updated 2 weeks ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆22Updated 7 months ago
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …☆37Updated 5 months ago
- ☆21Updated 5 months ago
- ☆25Updated last week
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated 5 months ago
- Who needs o1 anyways. Add CoT to any OpenAI compatible endpoint.☆41Updated 4 months ago
- Large-Language-Model to Machine Interface project.☆17Updated last year
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆17Updated 3 weeks ago
- Build HTML artefacts with Ollama☆11Updated last month
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 3 months ago
- ☆16Updated last month
- ☆27Updated 3 months ago
- This benchmark tests how well LLMs incorporate a set of 10 mandatory story elements (characters, objects, core concepts, attributes, moti…☆35Updated last week
- Easily view and modify JSON datasets for large language models☆69Updated 3 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆31Updated 6 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated 9 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools.☆15Updated 2 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆67Updated 4 months ago
- run ollama & gguf easily with a single command☆49Updated 8 months ago
- A Python-based chat application utilizing a Local LLM to generate complex thought chains for various use cases such as product developmen…☆17Updated 4 months ago
- An F/OSS solution combining AI with Wikipedia knowledge via a RAG pipeline☆26Updated 2 weeks ago
- Something similar to Apple Intelligence?☆58Updated 6 months ago
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆41Updated 3 months ago