mostlygeek / llama-swap
transparent proxy server for llama.cpp's server to provide automatic model swapping
☆187Updated this week
Alternatives and similar repositories for llama-swap:
Users that are interested in llama-swap are comparing it to the libraries listed below
- Open source LLM UI, compatible with all local LLM providers.☆171Updated 5 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆68Updated 4 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆53Updated this week
- ☆192Updated 3 weeks ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆146Updated 9 months ago
- A frontend for creative writing with LLMs☆117Updated 7 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆225Updated 2 months ago
- Easily view and modify JSON datasets for large language models☆71Updated last week
- A fast batching API to serve LLM models☆180Updated 9 months ago
- AI management tool☆112Updated 3 months ago
- What If Language Models Expertly Routed All Inference? WilmerAI allows prompts to be routed to specialized workflows based on the domain …☆513Updated last week
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆61Updated 3 months ago
- ☆77Updated last month
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆297Updated this week
- Eternal is an experimental platform for machine learning models and workflows.☆68Updated 6 months ago
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆64Updated 3 months ago
- Code execution utilities for Open WebUI & Ollama☆249Updated 3 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆173Updated 7 months ago
- Efficient visual programming for AI language models☆345Updated 5 months ago
- Turns devices into a scalable LLM platform☆117Updated this week
- ☆124Updated 2 weeks ago
- AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models☆150Updated 9 months ago
- ☆268Updated 3 weeks ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆111Updated 3 months ago
- An OAI compatible exllamav2 API that's both lightweight and fast☆795Updated this week
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆76Updated 3 weeks ago