AlpinDale / sparsegpt-for-LLaMAView external linksLinks
Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.
☆71Mar 30, 2023Updated 2 years ago
Alternatives and similar repositories for sparsegpt-for-LLaMA
Users that are interested in sparsegpt-for-LLaMA are comparing it to the libraries listed below
Sorting:
- ☆40Mar 25, 2023Updated 2 years ago
- A Python library for fetching data from 4chan in a programmatically friendly way.☆13May 26, 2025Updated 8 months ago
- Falcon7B + Falcon40B support - in branch falcon40b. Now all good and working. But main action now in https://github.com/cmp-nct/ggllm.cpp☆10Sep 30, 2023Updated 2 years ago
- SDXL GPU cluster scripts☆16Oct 28, 2023Updated 2 years ago
- [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization☆713Aug 13, 2024Updated last year
- Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer☆13Jun 21, 2023Updated 2 years ago
- A tool to easily benchmark Japanese translation skills☆13Oct 11, 2025Updated 4 months ago
- ✅ ChatGPT Plugin for performing basic arithmetic operations☆18May 15, 2023Updated 2 years ago
- ☆24May 23, 2025Updated 8 months ago
- A small standalone flask python server for llama.cpp that acts like a KoboldAI api.☆14May 20, 2023Updated 2 years ago
- Code for my ICLR 2024 TinyPapers paper "Prune and Tune: Improving Efficient Pruning Techniques for Massive Language Models"☆16May 26, 2023Updated 2 years ago
- ☆535Dec 1, 2023Updated 2 years ago
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆323Mar 27, 2023Updated 2 years ago
- ☆20Mar 28, 2023Updated 2 years ago
- Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip☆37Jan 27, 2026Updated 2 weeks ago
- ☆92Apr 9, 2023Updated 2 years ago
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆22Nov 18, 2024Updated last year
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Jul 26, 2022Updated 3 years ago
- GPP CPassword Decryption Tools☆12Jun 13, 2022Updated 3 years ago
- anagora.org/node/agora-bot☆23Jan 28, 2026Updated 2 weeks ago
- This was orginally written by: https://github.com/hlky☆49Oct 12, 2023Updated 2 years ago
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 4 months ago
- ☆337Jul 28, 2025Updated 6 months ago
- LLM Powered discord bot, Character Card enabled Chat page, Stable Diffusion discord bot, and overall AI tool. All from one app, TalOS: Re…☆34Oct 20, 2024Updated last year
- fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…☆412Jun 2, 2023Updated 2 years ago
- A Simple Discord Bot for the Alpaca LLM☆99Jun 22, 2023Updated 2 years ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Aug 27, 2023Updated 2 years ago
- ☆50Jun 16, 2025Updated 7 months ago
- Modeling code for a BitNet b1.58 Llama-style model.☆25Apr 30, 2024Updated last year
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆31May 29, 2023Updated 2 years ago
- A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.☆310Aug 22, 2023Updated 2 years ago
- An interface for llama.cpp, ChatGPT, Gemini, and Claude☆27Jan 29, 2026Updated 2 weeks ago
- Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…☆30Apr 8, 2023Updated 2 years ago
- [ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.☆134May 16, 2024Updated last year
- 트랜스포머 블록을 활용한 상품명 자연어처리 기반 카테고리 분류 모델☆10Dec 5, 2022Updated 3 years ago
- Just a bunch of benchmark logs for different LLMs☆119Jul 28, 2024Updated last year
- 4 bits quantization of LLaMA using GPTQ☆3,074Jul 13, 2024Updated last year
- Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing …☆32Nov 20, 2023Updated 2 years ago
- VPTQ, A Flexible and Extreme low-bit quantization algorithm☆674Apr 25, 2025Updated 9 months ago