AlpinDale/RPTQ-for-LLaMA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AlpinDale/RPTQ-for-LLaMA)

AlpinDale / RPTQ-for-LLaMA

Efficient 3bit/4bit quantization of LLaMA models

☆18

Alternatives and similar repositories for RPTQ-for-LLaMA

Users that are interested in RPTQ-for-LLaMA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

practical-dreamer / vicuna_to_alpacan
View on GitHub
Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer
☆12Jun 21, 2023Updated 3 years ago
AlpinDale / sillytui
View on GitHub
LLM RP TUI for Power Users.
☆35Jan 13, 2026Updated 6 months ago
VE-FORBRYDERNE / mesh-transformer-jax
View on GitHub
Fork of kingoflolz/mesh-transformer-jax with memory usage optimizations and support for GPT-Neo, GPT-NeoX, BLOOM, OPT and fairseq dense L…
☆22Nov 14, 2022Updated 3 years ago
AlpinDale / sparsegpt-for-LLaMA
View on GitHub
Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.
☆71Mar 30, 2023Updated 3 years ago
theroyallab / llm-prompt-templates
View on GitHub
Prompt Jinja2 templates for LLMs
☆36Jul 9, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
dphnAI / vessel
View on GitHub
Compile docker images into a single self-contained binary
☆20Apr 30, 2026Updated 2 months ago
VE-FORBRYDERNE / mtj-softtuner
View on GitHub
Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instance
☆28Mar 1, 2023Updated 3 years ago
NolanoOrg / sparse_quant_llms
View on GitHub
SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia
☆42Mar 13, 2023Updated 3 years ago
RyokoAI / BigKnow2022
View on GitHub
BigKnow2022: Bringing Language Models Up to Speed
☆16Mar 27, 2023Updated 3 years ago
diStyApps / Merge-Stable-Diffusion-models-without-distortion-gui
View on GitHub
gui for Merge-Stable-Diffusion-models-without-distortion-gui
☆36Dec 31, 2022Updated 3 years ago
PygmalionAI / pygmalion-docs
View on GitHub
The Pygmalion Docs
☆19Sep 16, 2023Updated 2 years ago
DDGRCF / GLCC_AndroidApplication
View on GitHub
An Android Application for GLCC
☆11Sep 30, 2022Updated 3 years ago
jllllll / GPTQ-for-LLaMa-Wheels
View on GitHub
Precompiled Wheels for GPTQ-for-LLaMa
☆19Jul 26, 2023Updated 3 years ago
OpenPPL / hpcc
View on GitHub
CMake configurations for PPL projects
☆12Aug 10, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Gryphe / MergeMonster
View on GitHub
An unsupervised model merging algorithm for Transformers-based language models.
☆107Apr 29, 2024Updated 2 years ago
eugenepentland / landmark-attention-qlora
View on GitHub
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Jun 16, 2023Updated 3 years ago
Aemon-Algiz / LoRAExamples
View on GitHub
Small repository for my video on LoRA
☆16May 14, 2023Updated 3 years ago
winston779 / m78star
View on GitHub
M78星云机场官网地址
☆13Nov 20, 2025Updated 8 months ago
Ph0rk0z / text-generation-webui-testing
View on GitHub
A fork of textgen that kept some things like Exllama and old GPTQ.
☆22Aug 20, 2024Updated last year
shiimizu / ComfyUI-semantic-aware-guidance
View on GitHub
Unofficial implementation of Semantic-aware Guidance (S-CFG) for ComfyUI
☆13Aug 8, 2024Updated last year
eziapp-org / core
View on GitHub
EziApp：更快、更轻量！使用Web技术构建原生能力客户端
☆16May 9, 2026Updated 2 months ago
db0 / KoboldAI-Horde-Bridge
View on GitHub
Turns KoboldAI into a crowdsourced distributed cluster
☆34Oct 19, 2023Updated 2 years ago
ConiferLabsWA / flan-ul2-alpaca
View on GitHub
☆33Apr 23, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Azeirah / K70-RGB-shader-animations
View on GitHub
View shadertoy shaders on your keyboard, save them and use them as your keyboard background animation!
☆10Dec 14, 2016Updated 9 years ago
johnsmith0031 / alpaca_lora_4bit
View on GitHub
☆534Dec 1, 2023Updated 2 years ago
ebolam / KoboldAI
View on GitHub
☆33May 23, 2025Updated last year
lix19937 / llm-deploy
View on GitHub
AI Infra LLM infer/ tensorrt-llm/ vllm
☆24Updated this week
talos-bots / TalOS-Reborn
View on GitHub
LLM Powered discord bot, Character Card enabled Chat page, Stable Diffusion discord bot, and overall AI tool. All from one app, TalOS: Re…
☆33Oct 20, 2024Updated last year
Venkat2811 / myelon
View on GitHub
Ultra-low-latency, high-throughput multiprocess transport over SHM and mmap. LMAX-Disruptor-style cross-process ring substrate.
☆17Updated this week
LeiWang1999 / AutoGPTQ.tvm
View on GitHub
GPTQ inference TVM kernel
☆41Apr 25, 2024Updated 2 years ago
PINTO0309 / MobileNetv2-SSDLite
View on GitHub
My proprietary procedure. Caffe implementation of SSD and SSDLite detection on MobileNetv2, converted from tensorflow.
☆23Mar 20, 2019Updated 7 years ago
EdVince / Android_learning
View on GitHub
☆23Aug 7, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
7bvcxz / PIMsim
View on GitHub
☆20Jun 1, 2023Updated 3 years ago
coreyryanhanson / ComfyQR-scanning-nodes
View on GitHub
A set of ComfyUI nodes to quickly test generated QR codes for scannability. A companion project to ComfyQR.
☆15Jan 26, 2025Updated last year
WAUthethird / diffusers-uncensored
View on GitHub
Uncensored fork of diffusers
☆39Feb 13, 2023Updated 3 years ago
adigunturu / AugmentedPhysics
View on GitHub
Creating Interactive and Embedded Physics Simulations from Static Textbook Diagrams
☆31Mar 18, 2025Updated last year
CLUEbenchmark / SuperCLUE-Fin
View on GitHub
中文金融大模型测评基准，六大类二十五任务、等级化评价，国内模型获得A级
☆10May 6, 2024Updated 2 years ago
atxcowboy / megasearch
View on GitHub
A plugin for Oobabooga TextUI that allows you to search multiple search engines. Initially we're using Google API or DuckDuckGo.
☆18Jun 4, 2023Updated 3 years ago
mschulkind / cordova-true-native-android
View on GitHub
Plugin for Cordova that enables fully native-looking apps
☆18Feb 26, 2015Updated 11 years ago