AlpinDale/sparsegpt-for-LLaMA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AlpinDale/sparsegpt-for-LLaMA)

AlpinDale / sparsegpt-for-LLaMA

Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.

☆71

Alternatives and similar repositories for sparsegpt-for-LLaMA

Users that are interested in sparsegpt-for-LLaMA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

paniphons / open-textbot-datasets
View on GitHub
Collection of various text datasets to assist ML researchers in training or fine-tuning their models
☆21Apr 1, 2023Updated 3 years ago
AlpinDale / RPTQ-for-LLaMA
View on GitHub
Efficient 3bit/4bit quantization of LLaMA models
☆18May 18, 2023Updated 3 years ago
lachlansneff / sparsellama
View on GitHub
☆40Mar 25, 2023Updated 3 years ago
Aspartame-e951 / apiserver.py
View on GitHub
A small standalone flask python server for llama.cpp that acts like a KoboldAI api.
☆14May 20, 2023Updated 3 years ago
PASSIONLab / distributed_sddmm
View on GitHub
Distributed SDDMM Kernel
☆12Jul 8, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
db0 / KoboldAI-Horde-Bridge
View on GitHub
Turns KoboldAI into a crowdsourced distributed cluster
☆34Oct 19, 2023Updated 2 years ago
dphnAI / vessel
View on GitHub
Compile docker images into a single self-contained binary
☆20Apr 30, 2026Updated 2 months ago
VE-FORBRYDERNE / mesh-transformer-jax
View on GitHub
Fork of kingoflolz/mesh-transformer-jax with memory usage optimizations and support for GPT-Neo, GPT-NeoX, BLOOM, OPT and fairseq dense L…
☆22Nov 14, 2022Updated 3 years ago
johnsmith0031 / alpaca_lora_4bit
View on GitHub
☆534Dec 1, 2023Updated 2 years ago
SqueezeAILab / SqueezeLLM
View on GitHub
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
☆722Aug 13, 2024Updated last year
lastmile-ai / llama-retrieval-plugin
View on GitHub
LLaMa retrieval plugin script using OpenAI's retrieval plugin
☆321Mar 27, 2023Updated 3 years ago
Qwen-Applications / DIR
View on GitHub
☆17Feb 14, 2026Updated 5 months ago
IST-DASLab / sparsegpt
View on GitHub
Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".
☆891Aug 20, 2024Updated last year
vvvm23 / mezo-jax
View on GitHub
JAX implementation of "Fine-Tuning Language Models with Just Forward Passes"
☆19Jun 10, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
TehVenomm / LM_Transformers_BlockMerge
View on GitHub
Image Diffusion block merging technique applied to transformers based Language Models.
☆55May 8, 2023Updated 3 years ago
official-elinas / zeus-llm-trainer
View on GitHub
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Aug 27, 2023Updated 2 years ago
teknium1 / RawTransform
View on GitHub
A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.
☆34May 29, 2023Updated 3 years ago
PotatoSpudowski / fastLLaMa
View on GitHub
fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…
☆413Jun 2, 2023Updated 3 years ago
chu-tianxiang / QuIP-for-all
View on GitHub
QuIP quantization
☆66Mar 17, 2024Updated 2 years ago
pointnetwork / point-alpaca
View on GitHub
☆402Mar 22, 2023Updated 3 years ago
dynamiccreator / lora_scripts
View on GitHub
This repo helps to transform text into a better form for lora training
☆12Apr 9, 2023Updated 3 years ago
NolanoOrg / sparse_quant_llms
View on GitHub
SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia
☆42Mar 13, 2023Updated 3 years ago
Gryphe / MergeMonster
View on GitHub
An unsupervised model merging algorithm for Transformers-based language models.
☆107Apr 29, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
swtheing / WizardCoder_Instruct_Generator
View on GitHub
Generate the WizardCoder Instruct from the CodeAlpaca
☆21Jun 27, 2023Updated 3 years ago
talos-bots / TalOS-Reborn
View on GitHub
LLM Powered discord bot, Character Card enabled Chat page, Stable Diffusion discord bot, and overall AI tool. All from one app, TalOS: Re…
☆33Oct 20, 2024Updated last year
mirage-studio-io / chat-calculator-plugin
View on GitHub
✅ ChatGPT Plugin for performing basic arithmetic operations
☆18May 15, 2023Updated 3 years ago
imoneoi / bf16_fused_adam
View on GitHub
BFloat16 Fused Adam Operator for PyTorch
☆20Nov 16, 2024Updated last year
wawawario2 / long_term_memory
View on GitHub
A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.
☆305Aug 22, 2023Updated 2 years ago
ClayShoaf / oobabooga-one-click-bandaid
View on GitHub
A simple batch file to make the oobabooga one click installer compatible with llama 4bit models and able to run on cuda
☆21Mar 27, 2023Updated 3 years ago
Ph0rk0z / text-generation-webui-testing
View on GitHub
A fork of textgen that kept some things like Exllama and old GPTQ.
☆22Aug 20, 2024Updated last year
AlpinDale / sillytui
View on GitHub
LLM RP TUI for Power Users.
☆35Jan 13, 2026Updated 6 months ago
ogkalu2 / Merge-Stable-Diffusion-models-without-distortion
View on GitHub
Adaptation of the merging method described in the paper - Git Re-Basin: Merging Models modulo Permutation Symmetries (https://arxiv.org/a…
☆148Apr 30, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
shawnricecake / search-llm
View on GitHub
[NeurIPS 2024] Search for Efficient LLMs
☆16Jan 16, 2025Updated last year
AiMiDi / stable-diffusion-webui-advance-prompt-tuning
View on GitHub
Extension for stable diffusion webui to add advance prompt tuning
☆10Nov 13, 2022Updated 3 years ago
ModelTC / QLLM
View on GitHub
[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…
☆39Mar 11, 2024Updated 2 years ago
euclaise / SlimTrainer
View on GitHub
Full finetuning of large language models without large memory requirements
☆92Sep 22, 2025Updated 10 months ago
ArthurRichard / acanalysis
View on GitHub
A tool to explore/extract Ace Combat 4/5/0's files.
☆13Apr 23, 2022Updated 4 years ago
princeton-nlp / LLM-Shearing
View on GitHub
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
☆643Mar 4, 2024Updated 2 years ago
teknium1 / stanford_alpaca-replit
View on GitHub
Modified Stanford-Alpaca Trainer for Training Replit's Code Model
☆47Jun 1, 2023Updated 3 years ago