mostlygeek / llama-swap
Model swapping for llama.cpp (or any local OpenAPI compatible server)
☆709Updated this week
Alternatives and similar repositories for llama-swap:
Users that are interested in llama-swap are comparing it to the libraries listed below
- What If Language Models Expertly Routed All Inference? WilmerAI allows prompts to be routed to specialized workflows based on the domain …☆663Updated last month
- An OAI compatible exllamav2 API that's both lightweight and fast☆940Updated this week
- llama.cpp fork with additional SOTA quants and improved performance☆400Updated this week
- Manifold is a platform for enabling workflow automation using AI assistants.☆378Updated this week
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆341Updated this week
- ☆184Updated this week
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆311Updated 2 months ago
- Stateful load balancer custom-tailored for llama.cpp 🏓🦙☆747Updated last week
- Easy to use interface for the Whisper model optimized for all GPUs!☆191Updated last week
- Lightweight Inference server for OpenVINO☆163Updated last week
- Effortlessly run LLM backends, APIs, frontends, and services with one command.☆1,645Updated last week
- ☆201Updated 2 weeks ago
- Local LLM Powered Recursive Search & Smart Knowledge Explorer☆240Updated 2 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆251Updated 2 months ago
- The Fastest Way to Fine-Tune LLMs Locally☆293Updated last month
- Open source LLM UI, compatible with all local LLM providers.☆174Updated 7 months ago
- VS Code extension for LLM-assisted code/text completion☆692Updated 3 weeks ago
- Clara – Privacy-first, client-side AI assistant for Ollama with tool calling & mini n8n-style flow builder. No backend. No data leaks. 10…☆717Updated this week
- Notate is a desktop chat application that takes AI conversations to the next level. It combines the simplicity of chat with advanced feat…☆251Updated 2 months ago
- Large-scale LLM inference engine☆1,405Updated last week
- A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow you…☆209Updated 3 weeks ago
- This project demonstrates a basic chain-of-thought interaction with any LLM (Large Language Model)☆318Updated 7 months ago
- LLM Frontend in a single html file☆457Updated 3 months ago
- Web UI for ExLlamaV2☆493Updated 3 months ago
- A multi-platform desktop application to evaluate and compare LLM models, written in Rust and React.☆726Updated 3 weeks ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆558Updated 2 months ago
- OpenAPI Tool Servers☆347Updated last week
- Efficient visual programming for AI language models☆359Updated 7 months ago
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆582Updated 6 months ago
- Big & Small LLMs working together☆733Updated this week