theroyallab / tabbyAPI
An OAI compatible exllamav2 API that's both lightweight and fast
☆605Updated this week
Related projects ⓘ
Alternatives and complementary repositories for tabbyAPI
- Web UI for ExLlamaV2☆445Updated last month
- Large-scale LLM inference engine☆1,134Updated this week
- A multimodal, function calling powered LLM webui.☆208Updated last month
- LLM Frontend in a single html file☆259Updated 2 weeks ago
- A fast batching API to serve LLM models☆172Updated 6 months ago
- Memoir+ a persona extension for Text Gen Web UI. That includes memory, emotions, command handling and more.☆171Updated last month
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆202Updated last month
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆263Updated 2 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆126Updated 6 months ago
- An extension for oobabooga/text-generation-webui that enables the LLM to search the web using DuckDuckGo☆173Updated this week
- An AI assistant beyond the chat box.☆315Updated 8 months ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆493Updated 3 months ago
- Efficient visual programming for AI language models☆299Updated 2 months ago
- Simple go utility to download HuggingFace Models and Datasets☆509Updated 3 weeks ago
- Effortlessly run LLM backends, APIs, frontends, and services with one command.☆531Updated this week
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.☆476Updated 2 months ago
- Convert Compute And Books Into Instruct-Tuning Datasets! Makes: QA, RP, Classifiers.☆1,033Updated last week
- Open source LLM UI, compatible with all local LLM providers.☆167Updated 2 months ago
- A python application that routes incoming prompts to an LLM by category, and can support a single incoming connection from a front end to…☆167Updated this week
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆244Updated 2 weeks ago
- Y'all thought the dead internet theory wasn't real, but HERE IT IS☆171Updated 6 months ago
- ☆563Updated last month
- ☆115Updated last year
- Dolphin System Messages☆202Updated 2 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆162Updated 4 months ago
- Code execution utilities for Open WebUI & Ollama☆197Updated last week
- ☆227Updated 3 weeks ago
- AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models☆139Updated 6 months ago
- A simple FastAPI Server to run XTTSv2☆411Updated 3 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆232Updated 5 months ago