turboderp-org / exui
Web UI for ExLlamaV2
☆487Updated last month
Alternatives and similar repositories for exui:
Users that are interested in exui are comparing it to the libraries listed below
- An OAI compatible exllamav2 API that's both lightweight and fast☆863Updated this week
- A multimodal, function calling powered LLM webui.☆215Updated 6 months ago
- LLM Frontend in a single html file☆411Updated 2 months ago
- A fast batching API to serve LLM models☆182Updated 10 months ago
- An AI assistant beyond the chat box.☆322Updated last year
- Memoir+ a persona memory extension for Text Gen Web UI.☆193Updated last week
- Large-scale LLM inference engine☆1,342Updated this week
- An Autonomous LLM Agent that runs on Wizcoder-15B☆336Updated 5 months ago
- Clipboard Conqueror is a novel copy and paste copilot alternative designed to bring your very own LLM AI assistant to any text field.☆396Updated 2 months ago
- Dolphin System Messages☆272Updated last month
- TheBloke's Dockerfiles☆306Updated last year
- ☆273Updated last month
- Low-Rank adapter extraction for fine-tuned transformers models☆171Updated 10 months ago
- Easily view and modify JSON datasets for large language models☆71Updated 2 weeks ago
- Falcon LLM ggml framework with CPU and GPU support☆246Updated last year
- ☆196Updated last week
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆148Updated 10 months ago
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆305Updated 3 weeks ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆232Updated 2 weeks ago
- An extension for oobabooga/text-generation-webui that enables the LLM to search the web using DuckDuckGo☆231Updated this week
- AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models☆152Updated 10 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆233Updated 9 months ago
- Efficient visual programming for AI language models☆351Updated 6 months ago
- function calling-based LLM agents☆284Updated 6 months ago
- automatically quant GGUF models☆161Updated this week
- transparent proxy server for llama.cpp's server to provide automatic model swapping☆460Updated this week
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- Experimental LLM Inference UX to aid in creative writing☆113Updated 3 months ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆296Updated last week
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆545Updated last month