AlpinDale / gptq-gptj
Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.
☆15Updated 2 years ago
Alternatives and similar repositories for gptq-gptj
Users that are interested in gptq-gptj are comparing it to the libraries listed below
Sorting:
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 4 months ago
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated last year
- Experimental sampler to make LLMs more creative☆31Updated last year
- ☆12Updated 7 months ago
- ☆75Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated 2 years ago
- QLoRA with Enhanced Multi GPU Support☆37Updated last year
- ☆27Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia☆41Updated 2 years ago
- Self-hosted LLM chatbot arena, with yourself as the only judge☆40Updated last year
- Instruct-tune LLaMA on consumer hardware☆74Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- ☆73Updated last year
- Jupyter notebooks for cloud-based usage☆10Updated last year
- A super simple web interface to perform blind tests on LLM outputs.☆28Updated last year
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆20Updated 2 years ago
- ☆26Updated last year
- LLM finetuning☆42Updated last year
- ☆25Updated 3 months ago
- A minimalist Docker project to help people getting started with Node, WizardCoder, CTransformers, Python, Express and TypeScript. Ready t…☆14Updated last year
- ☆40Updated 2 years ago
- ☆53Updated 11 months ago
- Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.☆51Updated last year
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 5 months ago
- ☆37Updated 2 years ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆121Updated last year