mostlygeek / llama-swap
Model swapping for llama.cpp (or any local OpenAPI compatible server)
☆506Updated last week
Alternatives and similar repositories for llama-swap:
Users that are interested in llama-swap are comparing it to the libraries listed below
- What If Language Models Expertly Routed All Inference? WilmerAI allows prompts to be routed to specialized workflows based on the domain …☆640Updated last week
- Manifold is a platform for enabling workflow automation using AI assistants.☆364Updated this week
- An OAI compatible exllamav2 API that's both lightweight and fast☆901Updated 3 weeks ago
- The Fastest Way to Fine-Tune LLMs Locally☆290Updated 3 weeks ago
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆226Updated this week
- ☆167Updated 3 weeks ago
- Lightweight Inference server for OpenVINO☆152Updated this week
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆248Updated last month
- Local LLM Powered Recursive Search & Smart Knowledge Explorer☆233Updated 2 months ago
- A proxy server for multiple ollama instances with Key security☆382Updated last week
- Stateful load balancer custom-tailored for llama.cpp 🏓🦙☆737Updated last week
- Large-scale LLM inference engine☆1,379Updated this week
- Open source LLM UI, compatible with all local LLM providers.☆173Updated 6 months ago
- ☆197Updated this week
- Code execution utilities for Open WebUI & Ollama☆269Updated 5 months ago
- Web UI for ExLlamaV2☆491Updated 2 months ago
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆352Updated 2 weeks ago
- A multi-platform desktop application to evaluate and compare LLM models, written in Rust and React.☆695Updated 3 months ago
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆577Updated 5 months ago
- A multimodal, function calling powered LLM webui.☆214Updated 6 months ago
- A tool to determine whether or not your PC can run a given LLM☆165Updated 2 months ago
- VS Code extension for LLM-assisted code/text completion☆656Updated 3 weeks ago
- Effortlessly run LLM backends, APIs, frontends, and services with one command.☆1,545Updated this week
- Chat with your current directory's files using a local or API LLM.☆344Updated last month
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆148Updated 11 months ago
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆309Updated last month
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆551Updated last month
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆238Updated last week
- Efficient visual programming for AI language models☆354Updated 7 months ago
- Notate is a desktop chat application that takes AI conversations to the next level. It combines the simplicity of chat with advanced feat…☆249Updated last month