kooshi / llama-swappoLinks
llama-swap + a minimal ollama compatible api
☆20Updated this week
Alternatives and similar repositories for llama-swappo
Users that are interested in llama-swappo are comparing it to the libraries listed below
Sorting:
- ☆35Updated last month
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends☆40Updated this week
- A library and CLI utilities for managing performance states of NVIDIA GPUs.☆27Updated 10 months ago
- Prometheus exporter for Linux based GDDR6/GDDR6X VRAM and GPU Core Hot spot temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆21Updated 10 months ago
- A simple tool to anonymize LLM prompts.☆64Updated 6 months ago
- Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.☆72Updated 2 weeks ago
- Simple node proxy for llama-server that enables MCP use☆13Updated 3 months ago
- ☆81Updated this week
- A simple Gradio WebUI for loading/unloading models and loras in tabbyAPI.☆22Updated 8 months ago
- ☆19Updated 10 months ago
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆11Updated 2 months ago
- Eternal is an experimental platform for machine learning models and workflows.☆68Updated 5 months ago
- Lightweight Inference server for OpenVINO☆193Updated this week
- A GTK4-based text-to-speech and AI assistant app in Rust, featuring PDF reading and LLM chat powered by Kokoro TTS☆16Updated last month
- Teaching AI to play the classic text adventure Zork using Large Language Models☆23Updated 2 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆67Updated last month
- Core, Junction, and VRAM temperature reader for Linux + GDDR6/GDDR6X GPUs☆47Updated 2 months ago
- A local front-end for open-weight LLMs with memory, RAG, TTS/STT, Elo ratings, and dynamic research tools. Built with React and FastAPI.☆30Updated this week
- Bookmarklet to pull and run hugging face GGUF models in Ollama☆14Updated 9 months ago
- KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuning☆26Updated 2 months ago
- Generate Your Own Private Morning Radio for Commute☆33Updated 6 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆158Updated last year
- A open webui function for better R1 experience☆79Updated 5 months ago
- Polyglot is a fast, elegant, and free translation tool using AI.☆63Updated 11 months ago
- "Pacha" TUI (Text User Interface) is a JavaScript application that utilizes the "blessed" library. It serves as a frontend for llama.cpp …☆36Updated 2 years ago
- AI powered Chatbot with real time updates.☆60Updated 9 months ago
- Chat with your pdf using your local LLM, OLLAMA client.(incomplete)☆37Updated 9 months ago
- ☆50Updated 5 months ago
- Add web search results to LLM prompts.☆31Updated last month
- LocalAI integration component for Home Assistant☆43Updated 2 years ago