Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models (ICLR 2024)
☆14May 31, 2025Updated last year
Alternatives and similar repositories for K-prune
Users that are interested in K-prune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Edge-guided Model Inversion for Accurate Data-Free Applications☆22Nov 13, 2025Updated 7 months ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆14Apr 29, 2025Updated last year
- ☆33Jul 8, 2024Updated last year
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆32Feb 18, 2026Updated 4 months ago
- [CVPR '24] Official implementation of the paper "Multiflow: Shifting Towards Task-Agnostic Vision-Language Pruning".☆23Mar 7, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MuLe: Multi-Grained Graph Learning for Multi-Behavior Recommendation (CIKM 2024)☆14Dec 21, 2024Updated last year
- Pytorch implementation of our paper accepted by ECCV 2022-- Fine-grained Data Distribution Alignment for Post-Training Quantization☆16Sep 13, 2022Updated 3 years ago
- Learnable Semi-structured Sparsity for Vision Transformers and Diffusion Transformers☆15Feb 7, 2025Updated last year
- DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling☆37Jul 12, 2024Updated last year
- torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.☆25Mar 29, 2024Updated 2 years ago
- [CVPR'24] Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression☆15Jul 1, 2024Updated last year
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆48Oct 10, 2024Updated last year
- ☆15Mar 20, 2024Updated 2 years ago
- [ICLR 2024] Hebbian Learning based Orthogonal Projection for Continual Learning of Spiking Neural Networks☆46Feb 20, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆45Jun 25, 2025Updated 11 months ago
- M-Zoom: Fast Dense-Block Detection in Tensors with Quality Guarantees (ECML/PKDD'16 & TKDD'18)☆27Oct 30, 2024Updated last year
- Hands-on tutorial about link analysis techniques☆39Nov 10, 2020Updated 5 years ago
- Published as a conference paper at ICLR 2024☆28Feb 26, 2024Updated 2 years ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆82Mar 1, 2025Updated last year
- [AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models☆76Jan 6, 2024Updated 2 years ago
- ☆33Jan 7, 2025Updated last year
- Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)☆69Mar 27, 2025Updated last year
- Library for pruning experts per language pair in NLLB-200☆34Jul 7, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- D-Cube: Dense-Block Detection in Terabyte-Scale Tensors (WSDM'17 & Frontiers in Big Data'21)☆32Oct 30, 2024Updated last year
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆15Sep 28, 2024Updated last year
- ☆10Feb 12, 2024Updated 2 years ago
- [ACL'24, Outstanding Paper] Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!☆39Aug 2, 2024Updated last year
- [AAAI 2025] PAT: Pruning-Aware Tuning for Large Language Models☆37Feb 1, 2025Updated last year
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆16Feb 15, 2025Updated last year
- Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.☆55Dec 15, 2025Updated 6 months ago
- AMES: Asymmetric and Memory-Efficient Similarity☆48Aug 12, 2025Updated 10 months ago
- Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.☆114Jun 29, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆110Oct 28, 2024Updated last year
- [ACM MM'2024] Official repository for "Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval"☆43Dec 23, 2024Updated last year
- Easy-to-use framework for graph continual learning with Python☆36Oct 10, 2024Updated last year
- ☆11Oct 22, 2024Updated last year
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆18Sep 2, 2024Updated last year
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 8 months ago
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated 2 years ago