jllllll / GPTQ-for-LLaMa-WheelsLinks

Precompiled Wheels for GPTQ-for-LLaMa

☆18

Alternatives and similar repositories for GPTQ-for-LLaMa-Wheels

Users that are interested in GPTQ-for-LLaMa-Wheels are comparing it to the libraries listed below

Sorting:

theubie / complex_memory
A KoboldAI-like memory extension for oobabooga's text-generation-webui
☆108Updated 7 months ago
Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆105Updated last year
s4rduk4r / alpaca_lora_4bit_readme
Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit
☆31Updated 2 years ago
zarakiquemparte / zaraki-tools
☆27Updated last year
oobabooga / GPTQ-for-LLaMa
4 bits quantization of LLaMa using GPTQ
☆129Updated 2 years ago
wsippel / bark_tts
Oobabooga extension for Bark TTS
☆119Updated last year
acpopescu / bitsandbytes
8-bit CUDA functions for PyTorch
☆44Updated 2 years ago
jllllll / exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
☆64Updated last year
jllllll / GPTQ-for-LLaMa-CUDA
A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.
☆22Updated last year
FartyPants / Playground
Text WebUI extension to add clever Notebooks to Chat mode
☆140Updated last week
FartyPants / the_muse
oobabooga extension - Experimental sampler to make LLMs more creative
☆23Updated last year
wawawario2 / long_term_memory
A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.
☆309Updated last year
Digitous / ModelREVOLVER
Model REVOLVER, a human in the loop model mixing system.
☆33Updated last year
Ph0rk0z / text-generation-webui-testing
A fork of textgen that kept some things like Exllama and old GPTQ.
☆22Updated 10 months ago
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
AlpinDale / RPTQ-for-LLaMA
Efficient 3bit/4bit quantization of LLaMA models
☆19Updated 2 years ago
Maximilian-Winter / AIRoleplay
Little AI roleplay program
☆58Updated last year
FartyPants / Training_PRO
Traing PRO extension for oobabooga WebUI - recent dev version
☆50Updated last week
Silver267 / pytorch-to-safetensor-converter
A simple converter which converts pytorch bin files to safetensor, intended to be used for LLM conversion.
☆69Updated last year
CoffeeVampir3 / ez-trainer
Train Llama Loras Easily
☆31Updated last year
neromous / RWKV-Ouroboros
This project is established for real-time training of the RWKV model.
☆49Updated last year
0cc4m / GPTQ-for-LLaMa
4 bits quantization of LLMs using GPTQ
☆49Updated last year
Pandaily591 / OnlySpeakTTS
Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…
☆52Updated last year
Keith-Hon / bitsandbytes-windows
8-bit CUDA functions for PyTorch in Windows 10
☆69Updated last year
danikhan632 / guidance_api
An Extension for oobabooga/text-generation-webui
☆36Updated last year
AlpinDale / pygmalion.cpp
C/C++ implementation of PygmalionAI/pygmalion-6b
☆56Updated 2 years ago
kanttouchthis / text_generation_webui_xtts
XTTSv2 Extension for oobabooga text-generation-webui
☆154Updated last year
VatsaDev / NanoPhi-alpha
GPT-2 small trained on phi-like data
☆66Updated last year
GiusTex / EdgeGPT
Extension for Text Generation Webui based on EdgeGPT, a reverse engineered API of Microsoft's Bing Chat AI
☆125Updated last year
mayank31398 / GPTQ-for-SantaCoder
4 bits quantization of SantaCoder using GPTQ
☆51Updated 2 years ago