kooshi / llama-swappoLinks
llama-swap + a minimal ollama compatible api
☆38Updated this week
Alternatives and similar repositories for llama-swappo
Users that are interested in llama-swappo are comparing it to the libraries listed below
Sorting:
- ☆87Updated 3 weeks ago
- ☆50Updated 2 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆86Updated this week
- "Pacha" TUI (Text User Interface) is a JavaScript application that utilizes the "blessed" library. It serves as a frontend for llama.cpp …☆36Updated 2 years ago
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends☆48Updated 4 months ago
- Generate Your Own Private Morning Radio for Commute☆33Updated 10 months ago
- Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.☆89Updated last week
- A local front-end for open-weight LLMs with memory, RAG, TTS/STT, Elo ratings, and dynamic research tools. Built with React and FastAPI.☆40Updated 4 months ago
- ☆35Updated last year
- Eternal is an experimental platform for machine learning models and workflows.☆68Updated 9 months ago
- GPU Power and Performance Manager☆64Updated last year
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆12Updated 7 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆165Updated last year
- Prometheus exporter for Linux based GDDR6/GDDR6X VRAM and GPU Core Hot spot temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆24Updated last year
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆47Updated 4 months ago
- ☆83Updated 10 months ago
- Aggregates compute from spare GPU capacity☆183Updated last week
- A simple Gradio WebUI for loading/unloading models and loras in tabbyAPI.☆20Updated last year
- A frontend for creative writing with LLMs☆144Updated last year
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆79Updated 3 weeks ago
- A simple tool to anonymize LLM prompts.☆65Updated 11 months ago
- Privacy-first agentic framework with powerful reasoning & task automation capabilities. Natively distributed and fully ISO 27XXX complian…☆68Updated 9 months ago
- German "Who Wants To Be A Millionaire" LLM Benchmarking.☆47Updated last week
- ☆51Updated 10 months ago
- ☆57Updated last year
- Writing Extension for Text Generation WebUI☆64Updated 4 months ago
- Guide on text completion large language model fine-tuning, including example scripts and training data acquiring.☆86Updated 10 months ago
- InferX: Inference as a Service Platform☆143Updated last week
- No-messing-around sh client for llama.cpp's server☆30Updated last year
- Produce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input your VRAM and RAM and the toolcha…☆75Updated this week