Structural Pruning for LLaMA
☆54May 20, 2023Updated 2 years ago
Alternatives and similar repositories for LLaMA-Pruning
Users that are interested in LLaMA-Pruning are comparing it to the libraries listed below
Sorting:
- ☆58Oct 6, 2023Updated 2 years ago
- BESA is a differentiable weight pruning technique for large language models.☆17Mar 4, 2024Updated 2 years ago
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆18Mar 21, 2023Updated 2 years ago
- [WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling☆56May 10, 2025Updated 9 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆92Feb 14, 2025Updated last year
- ☆40Mar 25, 2023Updated 2 years ago
- [NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baich…☆1,106Oct 7, 2024Updated last year
- ☆13May 21, 2023Updated 2 years ago
- (ECCV2022) EAGAN: EAGAN: Efficient Two-stage Evolutionary Architecture Search for GANs☆12Sep 15, 2022Updated 3 years ago
- A tool for extracting plain text and internal Wikipedia links from Wikipedia dumps☆11Apr 18, 2019Updated 6 years ago
- Pytorch、Numpy实现NMS、Soft-NMS代码☆12Mar 22, 2021Updated 4 years ago
- Accepted at IJCAI-2022☆11Sep 3, 2022Updated 3 years ago
- [ACL'22] Training-free Neural Architecture Search for RNNs and Transformers☆14May 26, 2024Updated last year
- Official Code for Dataset Distillation using Neural Feature Regression (NeurIPS 2022)☆48Nov 12, 2022Updated 3 years ago
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆34Mar 11, 2025Updated 11 months ago
- ☆12Nov 25, 2021Updated 4 years ago
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 2 years ago
- Code Implementation for "NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models" (EMNLP …☆17Oct 17, 2023Updated 2 years ago
- An interactive demo based on Segment-Anything for style transfer which enables different content regions apply different styles.☆101Apr 24, 2023Updated 2 years ago
- Code for "Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes"☆30Mar 28, 2024Updated last year
- Official implementation of TransNormerLLM: A Faster and Better LLM☆252Jan 23, 2024Updated 2 years ago
- [NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models☆113Sep 27, 2025Updated 5 months ago
- PyTorch implementation of paper "StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization" in ICML 2024.☆15Jun 4, 2024Updated last year
- Transmute AI Lab Model Efficiency Toolkit☆19Oct 2, 2023Updated 2 years ago
- Code for my ICLR 2024 TinyPapers paper "Prune and Tune: Improving Efficient Pruning Techniques for Massive Language Models"☆16May 26, 2023Updated 2 years ago
- Fork of Flame repo for training of some new stuff in development☆19Feb 20, 2026Updated last week
- Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation☆35Feb 27, 2024Updated 2 years ago
- Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)☆67Mar 27, 2025Updated 11 months ago
- Unofficial implementations of block/layer-wise pruning methods for LLMs.☆77Apr 29, 2024Updated last year
- Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)☆18Dec 5, 2024Updated last year
- ☆42Apr 23, 2024Updated last year
- [ECCV 2024] Isomorphic Pruning for Vision Models☆81Jul 23, 2024Updated last year
- ☆18Nov 6, 2019Updated 6 years ago
- Official code for paper "Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models, ICML2024"☆27Feb 2, 2025Updated last year
- [MICCAI'22] AutoLaparo: A New Dataset of Integrated Multi-tasks for Image-guided Surgical Automation in Laparoscopic Hysterectomy☆19Mar 21, 2025Updated 11 months ago
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆58May 29, 2023Updated 2 years ago
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆310Sep 16, 2024Updated last year
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆115Jul 9, 2025Updated 7 months ago
- [CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.☆3,262Sep 7, 2025Updated 5 months ago