bjj / exllamav2-openai-serverLinks

An OpenAI API compatible LLM inference server based on ExLlamaV2.

☆25

Alternatives and similar repositories for exllamav2-openai-server

Users that are interested in exllamav2-openai-server are comparing it to the libraries listed below

Sorting:

Hellisotherpeople / llm_steer-oobabooga
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…
☆43Updated last year
thooton / muse
Let's create synthetic textbooks together :)
☆75Updated last year
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆179Updated last year
the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆31Updated 2 years ago
CoffeeVampir3 / ez-trainer
Train Llama Loras Easily
☆31Updated 2 years ago
Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆108Updated last year
zarakiquemparte / zaraki-tools
☆27Updated 2 years ago
desik1998 / MathWithLLMs
☆50Updated last year
Digitous / ModelREVOLVER
Model REVOLVER, a human in the loop model mixing system.
☆33Updated 2 years ago
teknium1 / ShareGPT-Builder
☆117Updated 11 months ago
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆27Updated last year
reka-ai / rekaquant
☆62Updated 4 months ago
VatsaDev / NanoPhi-alpha
GPT-2 small trained on phi-like data
☆67Updated last year
nicholasyager / llama-cpp-guidance
A guidance compatibility layer for llama-cpp-python
☆36Updated 2 years ago
emrgnt-cmplxty / zero-shot-replication
☆74Updated 2 years ago
l4b4r4b4b4 / AIDocks
LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT
☆27Updated last year
huggingface / discord-bots
☆51Updated 2 years ago
monk1337 / auto-ollama
run ollama & gguf easily with a single command
☆52Updated last year
serp-ai / unsloth
5X faster 60% less memory QLoRA finetuning
☆21Updated last year
nyunAI / PruneGPT
☆51Updated last year
attashe / ModifiedBeamSampler
Modified Beam Search with periodical restart
☆12Updated last year
emrgnt-cmplxty / SmolTrainer
☆21Updated 2 years ago
ahmed-moubtahij / TokenHealer
☆23Updated last year
the-crypt-keeper / LLooM
Experimental LLM Inference UX to aid in creative writing
☆127Updated 11 months ago
tolitius / towel
"a towel is about the most massively useful thing an interstellar AI hitchhiker can have"
☆48Updated last year
QuixiAI / kraken
☆68Updated last year
shivamsanju / ragswift
🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform
☆38Updated last year
huseinzol05 / transformers-continuous-batching
Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.
☆29Updated 8 months ago
adrienbrault / ollama-nous-hermes2pro
Ollama models of NousResearch/Hermes-2-Pro-Mistral-7B-GGUF
☆32Updated last year
OpenAccess-AI-Collective / ggml-webui
Deploy your GGML models to HuggingFace Spaces with Docker and gradio
☆38Updated 2 years ago