Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.
☆71Mar 30, 2023Updated 2 years ago
Alternatives and similar repositories for sparsegpt-for-LLaMA
Users that are interested in sparsegpt-for-LLaMA are comparing it to the libraries listed below
Sorting:
- ☆40Mar 25, 2023Updated 2 years ago
- Structural Pruning for LLaMA☆54May 20, 2023Updated 2 years ago
- An AI tool designed to generate explanations for every file in a project☆14Mar 7, 2025Updated last year
- A Python library for fetching data from 4chan in a programmatically friendly way.☆13May 26, 2025Updated 9 months ago
- Falcon7B + Falcon40B support - in branch falcon40b. Now all good and working. But main action now in https://github.com/cmp-nct/ggllm.cpp☆10Sep 30, 2023Updated 2 years ago
- Thin wrapper around GGML to make life easier☆42Nov 5, 2025Updated 4 months ago
- [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization☆712Aug 13, 2024Updated last year
- A tool to easily benchmark Japanese translation skills☆13Oct 11, 2025Updated 4 months ago
- A small standalone flask python server for llama.cpp that acts like a KoboldAI api.☆14May 20, 2023Updated 2 years ago
- ☆25May 23, 2025Updated 9 months ago
- Turns KoboldAI into a crowdsourced distributed cluster☆33Oct 19, 2023Updated 2 years ago
- Code for my ICLR 2024 TinyPapers paper "Prune and Tune: Improving Efficient Pruning Techniques for Massive Language Models"☆16May 26, 2023Updated 2 years ago
- ☆535Dec 1, 2023Updated 2 years ago
- ☆404Mar 22, 2023Updated 2 years ago
- ☆20Mar 28, 2023Updated 2 years ago
- ☆92Apr 9, 2023Updated 2 years ago
- GPP CPassword Decryption Tools☆12Jun 13, 2022Updated 3 years ago
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆22Nov 18, 2024Updated last year
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Jul 26, 2022Updated 3 years ago
- anagora.org/node/agora-bot☆23Feb 14, 2026Updated 2 weeks ago
- ☆95Jun 4, 2024Updated last year
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 5 months ago
- Trigger any command palette command via an obsidian:// uri☆27Jun 30, 2021Updated 4 years ago
- ☆337Updated this week
- LLM Powered discord bot, Character Card enabled Chat page, Stable Diffusion discord bot, and overall AI tool. All from one app, TalOS: Re…☆34Oct 20, 2024Updated last year
- Everything about me goes here!☆23Jun 18, 2023Updated 2 years ago
- fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…☆412Jun 2, 2023Updated 2 years ago
- [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆642Mar 4, 2024Updated 2 years ago
- A Simple Discord Bot for the Alpaca LLM☆97Jun 22, 2023Updated 2 years ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Aug 27, 2023Updated 2 years ago
- C/C++ implementation of PygmalionAI/pygmalion-6b☆55Apr 18, 2023Updated 2 years ago
- Pytorch code for paper QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models☆25Sep 27, 2023Updated 2 years ago
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Jun 1, 2023Updated 2 years ago
- Download images and convert it to pdf (NSFW: A+)☆14Mar 29, 2025Updated 11 months ago
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆31May 29, 2023Updated 2 years ago
- ☆50Jun 16, 2025Updated 8 months ago
- A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.☆310Aug 22, 2023Updated 2 years ago
- Train Llama Loras Easily☆31Aug 3, 2023Updated 2 years ago
- Mistral7B playing DOOM☆29Mar 27, 2024Updated last year