haiquanlu / AlphaPruning
[NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models
☆23Updated last month
Alternatives and similar repositories for AlphaPruning
Users that are interested in AlphaPruning are comparing it to the libraries listed below
Sorting:
- Official Pytorch Implementation of Our Paper Accepted at ICLR 2024-- Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLM…☆47Updated last year
- [ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference☆39Updated 11 months ago
- Elucidated Dataset Condensation (NeurIPS 2024)☆21Updated 7 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆58Updated 2 months ago
- Data distillation benchmark☆58Updated this week
- ☆18Updated 5 months ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆21Updated 8 months ago
- ☆50Updated 4 months ago
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆24Updated 11 months ago
- Official Pytorch Implementation of "OwLore: Outlier-weighed Layerwise Sampled Low-Rank Projection for Memory-Efficient LLM Fine-tuning" b…☆31Updated 11 months ago
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆20Updated 11 months ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆78Updated 6 months ago
- ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching☆102Updated 11 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆44Updated 6 months ago
- [ICLR 2025] The official pytorch implement of "Dynamic Low-Rank Sparse Adaptation for Large Language Models".☆16Updated last month
- Awesome-Low-Rank-Adaptation☆95Updated 7 months ago
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆34Updated last month
- ☆14Updated 2 years ago
- Official implementation for LaCo (EMNLP 2024 Findings)☆16Updated 7 months ago
- A block pruning framework for LLMs.☆22Updated 10 months ago
- Github Repo for OATS: Outlier-Aware Pruning through Sparse and Low Rank Decomposition☆12Updated 3 weeks ago
- ☆51Updated last year
- [CVPR 2024] On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm☆69Updated 2 months ago
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆128Updated 5 months ago
- [CVPR2024] Efficient Dataset Distillation via Minimax Diffusion☆90Updated last year
- This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).☆37Updated 6 months ago
- ☆10Updated 3 months ago
- Official Implementation of paper "Distilling Long-tailed Datasets"☆13Updated 2 months ago
- BESA is a differentiable weight pruning technique for large language models.☆16Updated last year
- A curated list of Model Merging methods.☆92Updated 7 months ago