AlpinDale / gptq-gptjLinks
Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.
☆15Updated 2 years ago
Alternatives and similar repositories for gptq-gptj
Users that are interested in gptq-gptj are comparing it to the libraries listed below
Sorting:
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 5 months ago
- 4 bits quantization of SantaCoder using GPTQ☆51Updated 2 years ago
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated 2 years ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- ☆73Updated last year
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.☆56Updated 3 years ago
- Merge LLM that are split in to parts☆26Updated last year
- SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia☆41Updated 2 years ago
- Self-hosted LLM chatbot arena, with yourself as the only judge☆41Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- ☆40Updated 2 years ago
- A library for squeakily cleaning and filtering language datasets.☆47Updated last year
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- HuggingChat like UI in Gradio☆71Updated 2 years ago
- Experimental sampler to make LLMs more creative☆31Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated 2 years ago
- ☆12Updated 2 years ago
- Patch for MPT-7B which allows using and training a LoRA☆58Updated 2 years ago
- Web page with political compass quiz results for open LLMs☆37Updated last year
- ☆31Updated last year
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆21Updated 2 years ago
- ☆13Updated 2 years ago
- ☆27Updated last year
- Instruct-tune LLaMA on consumer hardware☆74Updated 2 years ago
- Tools for formatting large language model prompts.☆13Updated last year
- ☆74Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year