An implementation of the DISP-LLM method from the NeurIPS 2024 paper: Dimension-Independent Structural Pruning for Large Language Models.
☆24Aug 6, 2025Updated 7 months ago
Alternatives and similar repositories for DISP-LLM-Dimension-Independent-Structural-Pruning
Users that are interested in DISP-LLM-Dimension-Independent-Structural-Pruning are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Search for Efficient LLMs☆16Jan 16, 2025Updated last year
- 关于AI,ML,DA,DV等的几个经典案例,包括堵车模拟(NagelSchreckenberg)、蒙特卡洛排队问题(Monte Carlo Queuing Problem)、人脸识别(RecognitionFace)、遗传算法推断图像(IconGenetic)☆10Oct 14, 2018Updated 7 years ago
- Learning Deep Disentangled Embeddings with the F-Statistic Loss (NIPS 2018)☆10Oct 17, 2018Updated 7 years ago
- ☆11Jun 5, 2024Updated last year
- Implementation of NM sparsity recipe presented in the paper "Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers".☆11Feb 5, 2024Updated 2 years ago
- Code for the paper "Optimal Off-Policy Evaluation from Multiple Logging Policies"☆15Jul 17, 2021Updated 4 years ago
- ☆10Nov 27, 2024Updated last year
- ☆20Nov 26, 2025Updated 3 months ago
- Performs a faster tensor train (TT) decomposition for large sparse data☆14Sep 7, 2020Updated 5 years ago
- [EMNLP 24] Source code for paper 'AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tu…☆12Dec 15, 2024Updated last year
- Zeroth-order Min-max Optimization☆13Jun 28, 2020Updated 5 years ago
- Github Repo for OATS: Outlier-Aware Pruning through Sparse and Low Rank Decomposition☆18Apr 16, 2025Updated 10 months ago
- 中科大郑启龙2021年并行程序设计课程实验☆11Jan 15, 2022Updated 4 years ago
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…☆29Feb 17, 2025Updated last year
- Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304☆14Oct 11, 2022Updated 3 years ago
- Code for "Learning Deep Features in Instrumental Variable Regression" (https://arxiv.org/abs/2010.07154)☆16Sep 16, 2024Updated last year
- This is the official repo for "Differentiable Model Scaling using Differentiable Topk"☆12May 16, 2024Updated last year
- Code for Estimating Multi-cause Treatment Effects via Single-cause Perturbation (NeurIPS 2021)☆14Jan 5, 2022Updated 4 years ago
- [AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models☆70Jan 6, 2024Updated 2 years ago
- Official Code of The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks[ICML2022]☆17Sep 20, 2022Updated 3 years ago
- ☆20Jan 25, 2023Updated 3 years ago
- ZOSVRG-BlackBox-Adv☆13Oct 30, 2018Updated 7 years ago
- Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)☆67Mar 27, 2025Updated 11 months ago
- Training with Block Minifloat number representation☆18May 2, 2021Updated 4 years ago
- Accelerating DNN inference and training on Zynq☆16Jul 22, 2020Updated 5 years ago
- ☆27Mar 29, 2025Updated 11 months ago
- Learning adapter weights from task descriptions☆19Nov 12, 2023Updated 2 years ago
- This repository contains code for the MicroAdam paper.☆21Dec 14, 2024Updated last year
- ☆14May 7, 2019Updated 6 years ago
- Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"☆81Jul 7, 2025Updated 8 months ago
- (ICML2022) Off-Policy Evaluation for Large Action Spaces via Embeddings☆21Jul 27, 2022Updated 3 years ago
- ☆23Nov 26, 2024Updated last year
- pytorch implementation of Structured Bayesian Pruning☆19Jul 13, 2018Updated 7 years ago
- Code for paper "Estimating Causal Effects on Networked Observational Data via Representation Learning"☆20May 28, 2023Updated 2 years ago
- [ICDE 2023] Dynamic hypergraph structure learning for traffic flow forecasting☆21Oct 14, 2022Updated 3 years ago
- ☆23Nov 24, 2018Updated 7 years ago
- ThinK: Thinner Key Cache by Query-Driven Pruning☆27Feb 11, 2025Updated last year
- Code for the paper - ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning☆22Aug 13, 2024Updated last year
- Code for "Interpretable image classification with differentiable prototypes assignment", ECCV 2022☆28Nov 23, 2022Updated 3 years ago