mostlygeek / llama-swap
transparent proxy server for llama.cpp's server to provide automatic model swapping
☆460Updated this week
Alternatives and similar repositories for llama-swap:
Users that are interested in llama-swap are comparing it to the libraries listed below
- What If Language Models Expertly Routed All Inference? WilmerAI allows prompts to be routed to specialized workflows based on the domain …☆621Updated last week
- An OAI compatible exllamav2 API that's both lightweight and fast☆863Updated this week
- Open source LLM UI, compatible with all local LLM providers.☆173Updated 6 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆237Updated 2 weeks ago
- ☆196Updated last week
- Web UI for ExLlamaV2☆487Updated last month
- Efficient visual programming for AI language models☆351Updated 6 months ago
- Code execution utilities for Open WebUI & Ollama☆262Updated 4 months ago
- A multimodal, function calling powered LLM webui.☆215Updated 6 months ago
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆305Updated 3 weeks ago
- This project demonstrates a basic chain-of-thought interaction with any LLM (Large Language Model)☆318Updated 6 months ago
- Effortlessly run LLM backends, APIs, frontends, and services with one command.☆1,483Updated this week
- Dynamically structure language models to produce outputs that adhere to specific requirements without sacrificing their creative capabili…☆119Updated this week
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆566Updated 4 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆148Updated 10 months ago
- A fast batching API to serve LLM models☆182Updated 10 months ago
- Multi-modal modular data ingestion and retrieval☆460Updated this week
- A Python-based web-assisted large language model (LLM) search assistant using Llama.cpp☆344Updated 4 months ago
- Code for Papeg.ai☆221Updated 2 months ago
- You don’t need to read the code to understand how to build!☆184Updated 2 months ago
- A multi-platform desktop application to evaluate and compare LLM models, written in Rust and React.☆672Updated 2 months ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆547Updated last month
- Notate is a desktop chat application that takes AI conversations to the next level. It combines the simplicity of chat with advanced feat…☆245Updated 3 weeks ago