kooshi / llama-swappoLinks
llama-swap + a minimal ollama compatible api
☆44Updated 2 weeks ago
Alternatives and similar repositories for llama-swappo
Users that are interested in llama-swappo are comparing it to the libraries listed below
Sorting:
- ☆89Updated last month
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆88Updated last week
- No-messing-around sh client for llama.cpp's server☆30Updated last year
- ☆50Updated 3 months ago
- "Pacha" TUI (Text User Interface) is a JavaScript application that utilizes the "blessed" library. It serves as a frontend for llama.cpp …☆36Updated 2 years ago
- GPU Power and Performance Manager☆66Updated last year
- Generate Your Own Private Morning Radio for Commute☆32Updated 11 months ago
- A simple tool to anonymize LLM prompts.☆66Updated last year
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends☆50Updated 5 months ago
- A simple Gradio WebUI for loading/unloading models and loras in tabbyAPI.☆20Updated last year
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆12Updated 7 months ago
- A utility that uses Whisper to transcribe videos and various translation APIs to translate the transcribed text and save them as SRT (sub…☆74Updated last year
- Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.☆92Updated this week
- ☆20Updated last year
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆47Updated 4 months ago
- Prometheus exporter for Linux based GDDR6/GDDR6X VRAM and GPU Core Hot spot temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆24Updated last year
- Aggregates compute from spare GPU capacity☆187Updated last week
- Generate a llama-quantize command to copy the quantization parameters of any GGUF☆28Updated this week
- Eternal is an experimental platform for machine learning models and workflows.☆68Updated 10 months ago
- A tool to determine whether or not your PC can run a given LLM☆167Updated 11 months ago
- V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!☆36Updated 2 months ago
- ☆35Updated last year
- A library and CLI utilities for managing performance states of NVIDIA GPUs.☆33Updated last year
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Updated 9 months ago
- Code execution utilities for Open WebUI & Ollama☆317Updated last year
- A daemon that automatically manages the performance states of NVIDIA GPUs.☆109Updated 2 months ago
- ☆36Updated 5 months ago
- reddacted lets you analyze & sanitize your online footprint using LLMs, PII detection & sentiment analysis to identify anything that migh…☆116Updated 6 months ago
- A open webui function for better R1 experience☆78Updated 10 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Updated 2 months ago