theroyallab / tabbyAPI
An OAI compatible exllamav2 API that's both lightweight and fast
☆455Updated this week
Related projects: ⓘ
- Web UI for ExLlamaV2☆420Updated 2 weeks ago
- Large-scale LLM inference engine☆934Updated this week
- LLM Frontend in a single html file☆217Updated last week
- A multimodal, function calling powered LLM webui.☆204Updated 3 months ago
- A fast batching API to serve LLM models☆172Updated 4 months ago
- An extension for oobabooga/text-generation-webui that enables the LLM to search the web using DuckDuckGo☆159Updated this week
- Memoir+ a persona extension for Text Gen Web UI. That includes memory, emotions, command handling and more.☆164Updated last month
- Convert Compute And Books Into Instruct-Tuning Datasets (or classifiers)!☆816Updated this week
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆467Updated last month
- function calling-based LLM agents☆268Updated this week
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆287Updated 3 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆113Updated 4 months ago
- An AI assistant beyond the chat box.☆314Updated 6 months ago
- TheBloke's Dockerfiles☆296Updated 6 months ago
- A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.☆305Updated last year
- ☆112Updated last year
- ☆543Updated 2 weeks ago
- A simple FastAPI Server to run XTTSv2☆357Updated last month
- Customizable implementation of the self-instruct paper.☆1,004Updated 6 months ago
- Text WebUI extension to add clever Notebooks to Chat mode☆130Updated 8 months ago
- A prompt/context management system☆163Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆245Updated 7 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆230Updated 3 months ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆222Updated this week
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆229Updated last month
- Low-Rank adapter extraction for fine-tuned transformers model☆154Updated 4 months ago
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆223Updated 2 weeks ago
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.☆308Updated 3 weeks ago
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆260Updated 3 months ago
- Open source LLM UI, compatible with all local LLM providers.☆163Updated last week