AlpinDale / gptq-gptj
Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.
☆15Updated 2 years ago
Alternatives and similar repositories for gptq-gptj:
Users that are interested in gptq-gptj are comparing it to the libraries listed below
- ☆40Updated 2 years ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 3 months ago
- SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia☆41Updated 2 years ago
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- ☆27Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- 4 bits quantization of SantaCoder using GPTQ☆51Updated last year
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated last year
- A super simple web interface to perform blind tests on LLM outputs.☆28Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆11Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated 2 years ago
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- Experimental sampler to make LLMs more creative☆31Updated last year
- ☆13Updated last year
- The "GPT-API-Accelerate" project provides a set of Python classes for accelerating the process of generating responses to prompts using t…☆23Updated 6 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- Training a reward model for RLHF using RWKV.☆14Updated last year
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- ☆16Updated last year
- BlinkDL's RWKV-v4 running in the browser☆47Updated 2 years ago
- Self-hosted LLM chatbot arena, with yourself as the only judge☆39Updated last year
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆71Updated 2 years ago
- Tools for formatting large language model prompts.☆13Updated last year
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆20Updated 2 years ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆22Updated last month
- ☆73Updated last year