snudm-starlab / K-prune
Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models (ICLR 2024)
☆11Updated last month
Alternatives and similar repositories for K-prune:
Users that are interested in K-prune are comparing it to the libraries listed below
- SynQ: Accurate Zero-shot Quantization by Synthesis-aware Fine-tuning (ICLR 2025)☆23Updated last month
- Sturctured pruning algorithm for pruning Transformer☆31Updated last year
- SensiMix: Sensitivity-Aware 8-bit Index & 1-bit Value Mixed Precision Quantization for BERT Compression (PLOS One)☆34Updated 3 years ago
- PET: Parameter-efficient Knowledge Distillation on Transformer (PLOS One)☆15Updated last year
- Falcon: Lightweight and Accurate Convolution Based on Depthwise Separable Convolution (KAIS)☆44Updated 8 months ago
- Pea-KD: Parameter-efficient and accurate knowledge distillation on BERT (PLOS One)☆35Updated 2 years ago
- Flexible Convolutional Neural Network☆22Updated last year
- ☆33Updated 2 years ago
- Vector multiplication on Low-rank Matrix Factorization☆46Updated last year
- ☆83Updated last year
- Model-Agnostic Augmentation for Accurate Graph Classification (WWW 2022)☆20Updated 2 years ago
- ☆56Updated 2 years ago
- A dataset repository of "Accurate Action Recommendation for Smart Home via Two-Level Encoders and Commonsense Knowledge" (CIKM 2022)☆14Updated 2 years ago
- ☆11Updated last year
- Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…☆57Updated last year
- ☆25Updated last month
- 2022_AAAI accepted paper, NaturalInversion:Data-Free Image Synthesis Improving Real-World Consistency☆10Updated 3 years ago
- Python Implementation for Signed Random Walk with Restart (SRWR)☆8Updated 5 years ago
- ☆51Updated 4 months ago
- Easy-to-use framework for graph continual learning with Python☆34Updated 5 months ago
- Pipeline for employing a Lightweight deep learning models for LOW-power systems☆11Updated 2 years ago
- Official Implementation of "Genie: Show Me the Data for Quantization" (CVPR 2023)☆18Updated last year
- ☆50Updated last year
- ☆102Updated last year
- [ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs☆100Updated 3 months ago
- Official code for "Towards Better Utilzation of Multiple Views for Bundle Recommendation" (CIKM 24 Short)☆19Updated 3 months ago
- Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)☆52Updated last week
- ☆12Updated 11 months ago
- Structured Neuron Level Pruning to compress Transformer-based models [ECCV'24]☆12Updated 7 months ago
- 한국어 생성 문서의 원소 사실 관계에 대한 설명 기술☆14Updated 3 months ago