An implementation of the DISP-LLM method from the NeurIPS 2024 paper: Dimension-Independent Structural Pruning for Large Language Models.
☆25Aug 6, 2025Updated 7 months ago
Alternatives and similar repositories for DISP-LLM-Dimension-Independent-Structural-Pruning
Users that are interested in DISP-LLM-Dimension-Independent-Structural-Pruning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆57Jun 10, 2024Updated last year
- ☆35May 24, 2024Updated last year
- Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)☆67Mar 27, 2025Updated last year
- Performs a faster tensor train (TT) decomposition for large sparse data☆14Sep 7, 2020Updated 5 years ago
- DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing (WACV 2025)☆13Feb 7, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Implementation of NM sparsity recipe presented in the paper "Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers".☆11Feb 5, 2024Updated 2 years ago
- 关于AI,ML,DA,DV等的几个经典案例,包括堵车模拟(NagelSchreckenberg)、蒙特卡洛排队问题(Monte Carlo Queuing Problem)、人脸识别(RecognitionFace)、遗传算法推断图像(IconGenetic)☆10Oct 14, 2018Updated 7 years ago
- vortex particles for simulating smoke in 2d☆16Dec 13, 2021Updated 4 years ago
- ☆11Aug 2, 2024Updated last year
- ☆20Nov 26, 2025Updated 4 months ago
- Pytorch implementation of paper: Small Pre-trained Language Models Can be Fine-tuned as Large Models via Over-Parameterization.☆12May 18, 2023Updated 2 years ago
- Github Repo for OATS: Outlier-Aware Pruning through Sparse and Low Rank Decomposition☆18Apr 16, 2025Updated 11 months ago
- ☆11Jun 5, 2024Updated last year
- This is the official repo for "Differentiable Model Scaling using Differentiable Topk"☆12May 16, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆19Jul 12, 2023Updated 2 years ago
- Official repository for the paper: "Trees with Attention for Set Prediction Tasks" (ICML21)☆10Jan 19, 2022Updated 4 years ago
- Zeroth-order Min-max Optimization☆13Jun 28, 2020Updated 5 years ago
- Code for Learned Thresholds Token Merging and Pruning for Vision Transformers (LTMP). A technique to reduce the size of Vision Transforme…☆17Nov 24, 2024Updated last year
- This repository contains code for the MicroAdam paper.☆21Dec 14, 2024Updated last year
- ☆27Mar 29, 2025Updated last year
- Accelerating DNN inference and training on Zynq☆16Jul 22, 2020Updated 5 years ago
- Pipelined Processor which implements RV32i Instruction Set. Also contains pipelined L1 4-way set-associative Instruction Cache, direct-ma…☆14Dec 23, 2022Updated 3 years ago
- A TensorFlow [2.0] implementation of ProSeNet: "Interpretable and Steerable Sequence Learning via Prototypes" (Ming et al., 2019)☆12Dec 19, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for the paper - ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning☆22Aug 13, 2024Updated last year
- [ICML 2025] SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models☆55Aug 9, 2024Updated last year
- ☆15Dec 19, 2023Updated 2 years ago
- ☆23Nov 26, 2024Updated last year
- [AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models☆72Jan 6, 2024Updated 2 years ago
- Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304☆14Oct 11, 2022Updated 3 years ago
- 中科大郑启龙2021年并行程序设计课程实验☆11Jan 15, 2022Updated 4 years ago
- Open-source of MSD framework☆16Sep 12, 2023Updated 2 years ago
- Unofficial implementations of block/layer-wise pruning methods for LLMs.☆78Apr 29, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆49Mar 20, 2026Updated last week
- Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"☆81Jul 7, 2025Updated 8 months ago
- A Generative Adversarial Network (GAN) trained on the MNIST dataset, capable of creating fake but realistic looking MNIST digit images t…☆13Aug 30, 2023Updated 2 years ago
- Code release for paper ''DNF: Diffractive Neural Field for Lensless Microscopic Imaging''☆18Mar 14, 2024Updated 2 years ago
- ☆14May 7, 2019Updated 6 years ago
- ☆15Apr 29, 2025Updated 11 months ago
- A hybrid data- and physics-augmented CNN that predicts EM field distributions with ultrafast speed and high accuracy for entire classes o…☆22Nov 11, 2022Updated 3 years ago