sterlind / GPTQ-for-LLaMa
4 bits quantization of LLaMa using GPTQ
☆12Updated last year
Alternatives and similar repositories for GPTQ-for-LLaMa:
Users that are interested in GPTQ-for-LLaMa are comparing it to the libraries listed below
- Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer☆12Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆77Updated 11 months ago
- Text WebUI extension to add clever Notebooks to Chat mode☆139Updated last year
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆111Updated last year
- ☆73Updated last year
- ☆27Updated last year
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆158Updated last year
- Patch for MPT-7B which allows using and training a LoRA☆58Updated last year
- Simple and fast server for GPTQ-quantized LLaMA inference☆24Updated last year
- GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ☆99Updated last year
- A prompt/context management system☆169Updated last year
- ☆94Updated last year
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆37Updated last year
- Memoria is a human-inspired memory architecture for neural networks.☆62Updated 5 months ago
- GPT-2 small trained on phi-like data☆65Updated last year
- ☆168Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆117Updated last year
- Merge Transformers language models by use of gradient parameters.☆205Updated 7 months ago
- 4 bits quantization of SantaCoder using GPTQ☆51Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆147Updated last year
- manage histories of LLM applied applications☆88Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆43Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- Porting BabyAGI to Oobabooba.☆33Updated last year
- A guidance language for controlling large language models.☆45Updated last year
- This is our own implementation of 'Layer Selective Rank Reduction'☆233Updated 10 months ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆102Updated 8 months ago
- Load local LLMs effortlessly in a Jupyter notebook for testing purposes alongside Langchain or other agents. Contains Oobagooga and Kobol…☆211Updated last year