turboderp / exui
Web UI for ExLlamaV2
☆445Updated last month
Related projects ⓘ
Alternatives and complementary repositories for exui
- An OAI compatible exllamav2 API that's both lightweight and fast☆605Updated this week
- A multimodal, function calling powered LLM webui.☆208Updated last month
- Large-scale LLM inference engine☆1,134Updated this week
- LLM Frontend in a single html file☆259Updated 2 weeks ago
- A fast batching API to serve LLM models☆172Updated 6 months ago
- An AI assistant beyond the chat box.☆315Updated 8 months ago
- function calling-based LLM agents☆278Updated 2 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆126Updated 6 months ago
- Y'all thought the dead internet theory wasn't real, but HERE IT IS☆171Updated 6 months ago
- Memoir+ a persona extension for Text Gen Web UI. That includes memory, emotions, command handling and more.☆171Updated last month
- Efficient visual programming for AI language models☆299Updated 2 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆202Updated last month
- An extension for oobabooga/text-generation-webui that enables the LLM to search the web using DuckDuckGo☆173Updated this week
- This is our own implementation of 'Layer Selective Rank Reduction'☆232Updated 5 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated last year
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆493Updated 3 months ago
- ☆128Updated this week
- Open source LLM UI, compatible with all local LLM providers.☆167Updated 2 months ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆244Updated 2 weeks ago
- Falcon LLM ggml framework with CPU and GPU support☆244Updated 9 months ago
- TheBloke's Dockerfiles☆299Updated 8 months ago
- AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models☆139Updated 6 months ago
- Dolphin System Messages☆202Updated 2 months ago
- A python application that routes incoming prompts to an LLM by category, and can support a single incoming connection from a front end to…☆167Updated this week
- ☆227Updated last month
- Easily view and modify JSON datasets for large language models☆62Updated last month
- Low-Rank adapter extraction for fine-tuned transformers model☆162Updated 6 months ago
- AI management tool☆107Updated last week
- Text WebUI extension to add clever Notebooks to Chat mode☆133Updated 10 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆162Updated 4 months ago