0cc4m / GPTQ-for-LLaMaLinks
4 bits quantization of LLMs using GPTQ
☆49Updated last year
Alternatives and similar repositories for GPTQ-for-LLaMa
Users that are interested in GPTQ-for-LLaMa are comparing it to the libraries listed below
Sorting:
- An unsupervised model merging algorithm for Transformers-based language models.☆104Updated last year
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆109Updated 7 months ago
- Text WebUI extension to add clever Notebooks to Chat mode☆139Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆122Updated last year
- ☆158Updated last year
- Oobabooga extension for Bark TTS☆117Updated last year
- Where we keep our notes about model training runs.☆16Updated 2 years ago
- Model REVOLVER, a human in the loop model mixing system.☆32Updated last year
- A prompt/context management system☆170Updated 2 years ago
- 4 bits quantization of LLaMa using GPTQ☆129Updated 2 years ago
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆35Updated last year
- ☆27Updated last year
- A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.☆309Updated last year
- Image Diffusion block merging technique applied to transformers based Language Models.☆54Updated 2 years ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆109Updated last year
- Framework agnostic python runtime for RWKV models☆146Updated last year
- Traing PRO extension for oobabooga WebUI - recent dev version☆49Updated this week
- An extension for oobabooga's text-generation-webui that adds syntax highlighting to code snippets☆67Updated last year
- mikugg is a Frontend for "Generative Visual Novels"☆146Updated this week
- Extension for Text Generation Webui based on EdgeGPT, a reverse engineered API of Microsoft's Bing Chat AI☆126Updated last year
- An autonomous AI agent extension for Oobabooga's web ui☆174Updated last year
- C/C++ implementation of PygmalionAI/pygmalion-6b☆55Updated 2 years ago
- Train Llama Loras Easily☆30Updated last year
- Diffusion_TTS extension for booga☆68Updated 11 months ago
- Merge Transformers language models by use of gradient parameters.☆207Updated 9 months ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆63Updated last year
- An Extension for oobabooga/text-generation-webui☆36Updated last year
- Our data munging code.☆34Updated 8 months ago
- CHAracter State Management - a generative text adventure (engine)☆65Updated 7 months ago
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆70Updated 2 years ago