VITA-Group/ToST

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VITA-Group/ToST)

VITA-Group / ToST

[ICML2022] Training Your Sparse Neural Network Better with Any Mask. Ajay Jaiswal, Haoyu Ma, Tianlong Chen, ying Ding, and Zhangyang Wang

☆30

Alternatives and similar repositories for ToST

Users that are interested in ToST are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

namhoonlee / spp-public
View on GitHub
A Signal Propagation Perspective for Pruning Neural Networks at Initialization
☆14Jun 23, 2020Updated 6 years ago
hoonyyhoon / Synflow_SNIP_GraSP
View on GitHub
Comparison of method "Pruning at initialization prior to training" (Synflow/SNIP/GraSP) in PyTorch
☆18May 12, 2024Updated 2 years ago
ASU-ESIC-FAN-Lab / RepNet
View on GitHub
☆13Jul 3, 2025Updated last year
alecwangcq / GraSP
View on GitHub
Code for "Picking Winning Tickets Before Training by Preserving Gradient Flow" https://openreview.net/pdf?id=SkgsACVKPH
☆105Feb 18, 2020Updated 6 years ago
VITA-Group / llm-kick
View on GitHub
[ICLR 2024] Jaiswal, A., Gan, Z., Du, X., Zhang, B., Wang, Z., & Yang, Y. Compressing llms: The truth is rarely pure and never simple.
☆27Apr 21, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
x-zho14 / ProbMask-official
View on GitHub
Implementation of Effective Sparsification of Neural Networks with Global Sparsity Constraint
☆31Mar 24, 2022Updated 4 years ago
VITA-Group / ramanujan-on-pai
View on GitHub
[ICLR 2023] 'Revisiting Pruning At Initialization Through The Lens of Ramanujan Graph" by Duc Hoang, Shiwei Liu, Radu Marculescu, Atlas W…
☆14Aug 4, 2023Updated 2 years ago
LOG-postech / ZIP
View on GitHub
☆18Nov 10, 2025Updated 8 months ago
VITA-Group / Junk_DNA_Hypothesis
View on GitHub
[ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…
☆16Apr 21, 2025Updated last year
MehdiSet / PerFedMask
View on GitHub
☆16Feb 28, 2023Updated 3 years ago
VITA-Group / FreeTickets
View on GitHub
[ICLR 2022] "Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic Sparsity" by Shiwei Liu,…
☆27Jun 15, 2022Updated 4 years ago
yanghr / DeepHoyer
View on GitHub
DeepHoyer: Learning Sparser Neural Network with Differentiable Scale-Invariant Sparsity Measures
☆32Aug 13, 2020Updated 5 years ago
boone891214 / MEST
View on GitHub
[NeurIPS‘2021] "MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge", Geng Yuan, Xiaolong Ma, Yanzhi Wang et al…
☆17Mar 16, 2022Updated 4 years ago
ganguli-lab / Synaptic-Flow
View on GitHub
☆229Jul 25, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
LOG-postech / rethinking-LLM-pruning
View on GitHub
[EMNLP 2024] Official implementation of "Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimiza…
☆28Feb 21, 2025Updated last year
MingSun-Tse / Awesome-Pruning-at-Initialization
View on GitHub
[IJCAI'22 Survey] Recent Advances on Neural Network Pruning at Initialization.
☆59Oct 10, 2023Updated 2 years ago
VITA-Group / GraNet
View on GitHub
[Neurips 2021] Sparse Training via Boosting Pruning Plasticity with Neuroregeneration
☆31Feb 11, 2023Updated 3 years ago
cyz-ai / neural-approx-ss-lfi
View on GitHub
Codes for ICLR 21 paper: Neural Approximate Sufficient Statistics for Implicit Models
☆20Jun 23, 2022Updated 4 years ago
LOG-postech / Sassha
View on GitHub
[ICML 2025] Official Pytorch code for "SASSHA: Sharpness-aware Adaptive Second-order Optimization With Stable Hessian Approximation"
☆24Aug 11, 2025Updated 11 months ago
yuf11235 / python-opencv-eye_trail
View on GitHub
Python+OpenCV实现眼动追踪
☆14Mar 19, 2020Updated 6 years ago
ictnlp / LNMT-CA
View on GitHub
Code for EMNLP 2022 main conference paper "Low-resource Neural Machine Translation with Cross-modal Alignment".
☆15Apr 25, 2023Updated 3 years ago
guide2157 / ChulaXrayClassifier
View on GitHub
Xray Image Classifier in collaboration with Chulalongkorn University Computational Molecular Biology Research Group
☆12Aug 18, 2020Updated 5 years ago
aispeech-lab / TinyWASE
View on GitHub
PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…
☆11Jun 28, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
GhadaSokar / Dynamic-Sparse-Training-for-Deep-Reinforcement-Learning
View on GitHub
[IJCAI 2022] "Dynamic Sparse Training for Deep Reinforcement Learning" by Ghada Sokar, Elena Mocanu , Decebal Constantin Mocanu, Mykola P…
☆15May 13, 2022Updated 4 years ago
songmzhang / CBMI
View on GitHub
The code of ACL2022 paper "Conditional Bilingual Mutual Information based Adaptive Training for Neural Machine Translation"..
☆14Aug 6, 2022Updated 3 years ago
yuwvandy / Awesome-scaleGNN
View on GitHub
☆10Sep 16, 2021Updated 4 years ago
MadhumithaKannan / linear-regression-using-only-numpy
View on GitHub
Implementation of unregularized, l1 regularized and l2 regularized linear regression using numpy and without sklearn
☆11Oct 4, 2019Updated 6 years ago
miaozhang0525 / iDARTS
View on GitHub
codes for ICML2021 paper iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients
☆10May 27, 2021Updated 5 years ago
fastconvnets / cvpr2020
View on GitHub
Code for "Fast Sparse ConvNets" CVPR2020 submissions
☆12Nov 20, 2019Updated 6 years ago
Yaoming95 / CIAT
View on GitHub
code repo for EMNLP'21 Finding Counter-Interference Adapter for Multilingual Machine Translation
☆18Oct 19, 2022Updated 3 years ago
VinayTeki / Semantic_Segmentation
View on GitHub
KERAS: Multimodal Deep Learning for Semantic Segmentation (RGB, NIR Streams) - multiple architectures
☆11Jun 19, 2017Updated 9 years ago
lisasiyu / Cross-Align
View on GitHub
EMNLP2022 "Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment"
☆20Feb 19, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
gallego-posada / constrained_sparsity
View on GitHub
Official implementation for the paper "Controlled Sparsity via Constrained Optimization"
☆12Aug 10, 2022Updated 3 years ago
Adaxry / ss_on_decoding_steps.
View on GitHub
codes for "Scheduled Sampling Based on Decoding Steps for Neural Machine Translation" (long paper of EMNLP-2022)
☆20Aug 31, 2021Updated 4 years ago
xyjun / large-scale-GNN
View on GitHub
这项目主要收集大规模GNN（图神经网络）的相关研究
☆10May 26, 2020Updated 6 years ago
tedzhouhk / GCNP
View on GitHub
☆16Dec 8, 2021Updated 4 years ago
davidstutz / ipiano
View on GitHub
Implementation of the iPiano algorithm for non-convex and non-smooth optimization as described in [1].
☆12Nov 28, 2018Updated 7 years ago
mil-ad / snip
View on GitHub
Pytorch implementation of the paper "SNIP: Single-shot Network Pruning based on Connection Sensitivity" by Lee et al.
☆110Apr 23, 2019Updated 7 years ago
xidongwu / AutoTrainOnce
View on GitHub
☆21Oct 1, 2024Updated last year