0cc4m / GPTQ-for-LLaMaLinks
4 bits quantization of LLMs using GPTQ
☆49Updated 2 years ago
Alternatives and similar repositories for GPTQ-for-LLaMa
Users that are interested in GPTQ-for-LLaMa are comparing it to the libraries listed below
Sorting:
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆108Updated last year
- Text WebUI extension to add clever Notebooks to Chat mode☆144Updated 3 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated 2 years ago
- A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.☆309Updated 2 years ago
- 4 bits quantization of LLaMa using GPTQ☆131Updated 2 years ago
- Oobabooga extension for Bark TTS☆119Updated 2 years ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆109Updated 2 years ago
- An unsupervised model merging algorithm for Transformers-based language models.☆108Updated last year
- A prompt/context management system☆171Updated 2 years ago
- ☆157Updated 2 years ago
- Model REVOLVER, a human in the loop model mixing system.☆33Updated 2 years ago
- XTTSv2 Extension for oobabooga text-generation-webui☆155Updated 2 years ago
- A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.☆22Updated 2 years ago
- An Extension for oobabooga/text-generation-webui☆36Updated 2 years ago
- An autonomous AI agent extension for Oobabooga's web ui☆174Updated 2 years ago
- Framework agnostic python runtime for RWKV models☆147Updated 2 years ago
- An experimental open-source attempt to make GPT-4 fully autonomous.☆98Updated 2 years ago
- 4 bits quantization of SantaCoder using GPTQ☆51Updated 2 years ago
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆35Updated 2 years ago
- C/C++ implementation of PygmalionAI/pygmalion-6b☆56Updated 2 years ago
- ChatGPT-like Web UI for RWKVstic☆100Updated 2 years ago
- Merge Transformers language models by use of gradient parameters.☆209Updated last year
- Extension for Text Generation Webui based on EdgeGPT, a reverse engineered API of Microsoft's Bing Chat AI☆124Updated 2 years ago
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆32Updated 2 years ago
- ☆27Updated 2 years ago
- ☆535Updated last year
- Prototype UI for chatting with the Pygmalion models.☆235Updated 2 years ago
- Instruct-tuning LLaMA on consumer hardware☆66Updated 2 years ago
- Diffusion_TTS extension for booga☆68Updated 2 months ago
- Train Llama Loras Easily☆31Updated 2 years ago