QingruZhang/PLATON

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/QingruZhang/PLATON)

QingruZhang / PLATON

This pytorch package implements PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance (ICML 2022).

☆45

Alternatives and similar repositories for PLATON

Users that are interested in PLATON are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cliang1453 / SAGE
View on GitHub
No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)
☆29Feb 9, 2022Updated 4 years ago
cliang1453 / super-structured-lottery-tickets
View on GitHub
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)
☆19Jul 28, 2021Updated 4 years ago
yxli2123 / LoftQ
View on GitHub
☆234Jun 11, 2024Updated 2 years ago
WoosukKwon / retraining-free-pruning
View on GitHub
[NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers
☆197Feb 28, 2023Updated 3 years ago
cliang1453 / task-aware-distillation
View on GitHub
Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)
☆40Aug 28, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
VITA-Group / Structure-LTH
View on GitHub
[ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wa…
☆33Apr 9, 2023Updated 3 years ago
THUNLP-MT / Brote
View on GitHub
☆11Jan 19, 2025Updated last year
yxli2123 / LoSparse
View on GitHub
☆64Oct 17, 2023Updated 2 years ago
junjieliu2910 / DynamicSparseTraining
View on GitHub
[ICLR-2020] Dynamic Sparse Training: Find Efficient Sparse Network From Scratch With Trainable Masked Layers.
☆32Jan 20, 2020Updated 6 years ago
allenhaozhu / COLES
View on GitHub
☆21Dec 6, 2021Updated 4 years ago
Noahs-ARK / RFA
View on GitHub
☆33Apr 12, 2021Updated 5 years ago
huggingface / block_movement_pruning
View on GitHub
Block Sparse movement pruning
☆83Nov 26, 2020Updated 5 years ago
QingruZhang / AdaLoRA
View on GitHub
AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).
☆393Jun 1, 2023Updated 3 years ago
kyegomez / LM-Infinite
View on GitHub
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆40Nov 11, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
shentianxiao / FiLM
View on GitHub
☆13Oct 18, 2023Updated 2 years ago
zhangqi-here / UnifiedEAE
View on GitHub
A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck
☆10Sep 9, 2022Updated 3 years ago
Brett-z / LayerEditing
View on GitHub
A Model Agnostic function to directly remove specified layers from the LLM
☆10May 23, 2024Updated 2 years ago
YuxianMeng / CorefQA-pytorch
View on GitHub
A PyTorch implementation of the CorefQA Model.
☆10Jun 27, 2020Updated 6 years ago
shivamsaboo17 / PySNIP
View on GitHub
Single shot neural network pruning before training the model, based on connection sensitivity
☆11Aug 7, 2019Updated 6 years ago
Timothyxxx / KVCachePapers
View on GitHub
☆20May 24, 2024Updated 2 years ago
VITA-Group / SMC-Bench
View on GitHub
[ICLR 2023] "Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!" Shiwei Liu, Tianlong Chen, Zhenyu Zhang, Xuxi Chen…
☆28Aug 29, 2023Updated 2 years ago
merantix / acosp
View on GitHub
Semantic Segmentation in Pytorch
☆10Dec 9, 2022Updated 3 years ago
jaeho-lee / layer-adaptive-sparsity
View on GitHub
In progress.
☆69Mar 26, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
llyx97 / TAMT
View on GitHub
[NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…
☆15Oct 18, 2022Updated 3 years ago
RunxinXu / ContrastivePruning
View on GitHub
Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》
☆25Dec 15, 2021Updated 4 years ago
simonepri / fever-transformers
View on GitHub
📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks
☆12Feb 21, 2020Updated 6 years ago
fuzihaofzh / AnalyzeParameterEfficientFinetune
View on GitHub
On the Effectiveness of Parameter-Efficient Fine-Tuning
☆39Nov 4, 2023Updated 2 years ago
alecwangcq / GraSP
View on GitHub
Code for "Picking Winning Tickets Before Training by Preserving Gradient Flow" https://openreview.net/pdf?id=SkgsACVKPH
☆105Feb 18, 2020Updated 6 years ago
Harvard-CS-2881 / harvard-cs-2881-hw0
View on GitHub
harvard-cs-2881-classroom-hw0-c2881-hw0 created by GitHub Classroom
☆16Jul 26, 2025Updated 11 months ago
justincosentino / robust-sparse-networks
View on GitHub
The Search for Sparse, Robustness Neural Networks
☆11Mar 24, 2023Updated 3 years ago
Bai-YT / AdaptiveSmoothing
View on GitHub
Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".
☆10Feb 6, 2024Updated 2 years ago
ayaabdelsalam91 / saliency_guided_training
View on GitHub
☆13Nov 29, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
VijayLingam95 / SVFT
View on GitHub
☆35Feb 10, 2025Updated last year
mlfoundations / tabliblib
View on GitHub
A Python library for processing and filtering TabLib
☆14Aug 24, 2024Updated last year
Wizardcoast / Linear_Alignment
View on GitHub
This repo is reproduction resources for linear alignment paper, still working
☆17May 19, 2024Updated 2 years ago
VITA-Group / instant_soup
View on GitHub
[ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…
☆11Nov 28, 2023Updated 2 years ago
KoichiYasuoka / spaCy-Thai
View on GitHub
Dependency parser on Thai language
☆27Jan 25, 2025Updated last year
hikvision-research / SAViT
View on GitHub
☆13Sep 24, 2023Updated 2 years ago
siat-nlp / DDMN
View on GitHub
Code and data for the paper "Dual Dynamic Memory Network for End-to-End Multi-turn Task-oriented Dialog Systems".
☆14Aug 16, 2022Updated 3 years ago