theroyallab / tabbyAPILinks

The official API server for Exllama. OAI compatible, lightweight, and fast.

☆1,016

Alternatives and similar repositories for tabbyAPI

Users that are interested in tabbyAPI are comparing it to the libraries listed below

Sorting:

turboderp-org / exui
Web UI for ExLlamaV2
☆506Updated 5 months ago
turboderp-org / exllamav3
An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs
☆455Updated this week
aphrodite-engine / aphrodite-engine
Large-scale LLM inference engine
☆1,492Updated this week
lmg-anon / mikupad
LLM Frontend in a single html file
☆533Updated 6 months ago
mostlygeek / llama-swap
Model swapping for llama.cpp (or any local OpenAPI compatible server)
☆1,088Updated this week
oobabooga / text-generation-webui-extensions
☆648Updated last week
ikawrakow / ik_llama.cpp
llama.cpp fork with additional SOTA quants and improved performance
☆902Updated last week
matt-c1 / llama-3-quant-comparison
Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.
☆158Updated last year
theroyallab / YALS
☆81Updated last week
QuixiAI / dolphin-system-messages
Dolphin System Messages
☆321Updated 5 months ago
SomeOddCodeGuy / WilmerAI
What If Language Models Expertly Routed All Inference? WilmerAI allows prompts to be routed to specialized workflows based on the domain …
☆737Updated this week
itsme2417 / PolyMind
A multimodal, function calling powered LLM webui.
☆215Updated 10 months ago
Maximilian-Winter / llama-cpp-agent
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …
☆579Updated 5 months ago
v2rockets / Loyal-Elephie
Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible
☆319Updated 5 months ago
turboderp-org / exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
☆4,245Updated 2 weeks ago
matatonic / openedai-vision
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
☆259Updated 4 months ago
YellowRoseCx / koboldcpp-rocm
AI Inferencing at the Edge. A simple one-file way to run various GGML models with KoboldAI's UI with AMD ROCm offloading
☆656Updated 2 weeks ago
mamei16 / LLM_Web_search
An extension for oobabooga/text-generation-webui that enables the LLM to search the web using DuckDuckGo
☆251Updated last week
brucepro / Memoir
Memoir+ a persona memory extension for Text Gen Web UI.
☆210Updated 2 weeks ago
FailSpy / abliterator
Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens
☆492Updated last year
bodaay / HuggingFaceModelDownloader
Simple go utility to download HuggingFace Models and Datasets
☆707Updated 9 months ago
AndrewVeee / nucleo-ai
An AI assistant beyond the chat box.
☆328Updated last year
ILikeAI / AlwaysReddy
AlwaysReddy is a LLM voice assistant that is always just a hotkey away.
☆747Updated 4 months ago
epolewski / EricLLM
A fast batching API to serve LLM models
☆185Updated last year
matatonic / openedai-speech
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
☆797Updated 6 months ago
Woolverine94 / biniou
a self-hosted webui for 30+ generative ai
☆596Updated this week
henk717 / KoboldAI
KoboldAI is generative AI software optimized for fictional use, but capable of much more!
☆414Updated 6 months ago
antibitcoin / ReflectionAnyLLM
This project demonstrates a basic chain-of-thought interaction with any LLM (Large Language Model)
☆321Updated 10 months ago
Atinoda / text-generation-webui-docker
Docker variants of oobabooga's text-generation-webui, including pre-built images.
☆433Updated 3 weeks ago
LostRuins / lite.koboldai.net
A zero dependency web UI for any LLM backend, including KoboldCpp, OpenAI and AI Horde
☆128Updated this week