Efficient 3bit/4bit quantization of LLaMA models
☆18May 18, 2023Updated 2 years ago
Alternatives and similar repositories for RPTQ-for-LLaMA
Users that are interested in RPTQ-for-LLaMA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM RP TUI for Power Users.☆32Jan 13, 2026Updated 2 months ago
- Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer☆13Jun 21, 2023Updated 2 years ago
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆71Mar 30, 2023Updated 3 years ago
- Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.☆15Mar 22, 2023Updated 3 years ago
- Prompt Jinja2 templates for LLMs☆35Jul 9, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instance☆28Mar 1, 2023Updated 3 years ago
- BigKnow2022: Bringing Language Models Up to Speed☆16Mar 27, 2023Updated 3 years ago
- SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia☆42Mar 13, 2023Updated 3 years ago
- The official service back-end.☆13Apr 8, 2023Updated 2 years ago
- C/C++ implementation of PygmalionAI/pygmalion-6b☆55Apr 18, 2023Updated 2 years ago
- gui for Merge-Stable-Diffusion-models-without-distortion-gui☆36Dec 31, 2022Updated 3 years ago
- The Pygmalion Docs☆19Sep 16, 2023Updated 2 years ago
- SimplePIM is the first high-level programming framework for real-world processing-in-memory (PIM) architectures. Described in the PACT 20…☆31Oct 23, 2023Updated 2 years ago
- An Android Application for GLCC☆11Sep 30, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- CMake configurations for PPL projects☆12Aug 10, 2024Updated last year
- A set of 7 classic D&D dice for all your dice rolling needs. Dice rolls are just for show and are not visible in AI prompts.☆24Aug 1, 2025Updated 7 months ago
- An unsupervised model merging algorithm for Transformers-based language models.☆108Apr 29, 2024Updated last year
- ☆94Dec 9, 2025Updated 3 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆125Jun 16, 2023Updated 2 years ago
- Mixing models of stable diffusion without weights loss☆68Apr 19, 2023Updated 2 years ago
- BFloat16 Fused Adam Operator for PyTorch☆17Nov 16, 2024Updated last year
- Xenoblade 3 research☆14Dec 9, 2025Updated 3 months ago
- A simple Gradio WebUI for loading/unloading models and loras in tabbyAPI.☆20Nov 21, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Writings and Games by David Schirduan☆16Dec 31, 2025Updated 2 months ago
- ☆40Mar 25, 2023Updated 3 years ago
- Small repository for my video on LoRA☆16May 14, 2023Updated 2 years ago
- A set of ComfyUI nodes to quickly test generated QR codes for scannability. A companion project to ComfyQR.☆12Jan 26, 2025Updated last year
- Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separat…☆14Dec 12, 2025Updated 3 months ago
- Unofficial implementation of Semantic-aware Guidance (S-CFG) for ComfyUI☆12Aug 8, 2024Updated last year
- A fork of textgen that kept some things like Exllama and old GPTQ.☆22Aug 20, 2024Updated last year
- Simple local all-in-one install for IDEA2.ART☆26Jan 8, 2023Updated 3 years ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37May 14, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Turns KoboldAI into a crowdsourced distributed cluster☆33Oct 19, 2023Updated 2 years ago
- View shadertoy shaders on your keyboard, save them and use them as your keyboard background animation!☆10Dec 14, 2016Updated 9 years ago
- SDXL GPU cluster scripts☆16Oct 28, 2023Updated 2 years ago
- ☆33Apr 23, 2023Updated 2 years ago
- A C++ fork/rewrite of the smhasher project to bring Murmurhash v.3 to the Linux shell and to the PHP scripting language.☆21Jul 25, 2011Updated 14 years ago
- Windows low level keyboard hooking component☆14Feb 5, 2017Updated 9 years ago
- Compare Savant and PyTorch performance☆13Feb 9, 2024Updated 2 years ago