microsoft / fnl_paperLinks

Factorized Neural Layers

☆29

Alternatives and similar repositories for fnl_paper

Users that are interested in fnl_paper are comparing it to the libraries listed below

Sorting:

sjunhongshen / DASH
☆23Updated 2 years ago
facebookresearch / spartan
Spartan is an algorithm for training sparse neural network models. This repository accompanies the paper "Spartan Differentiable Sparsity…
☆24Updated 2 years ago
IST-DASLab / WoodFisher
Code accompanying the NeurIPS 2020 paper: WoodFisher (Singh & Alistarh, 2020)
☆52Updated 4 years ago
HayeonLee / MetaD2A
Official PyTorch implementation of "Rapid Neural Architecture Search by Learning to Generate Graphs from Datasets" (ICLR 2021)
☆64Updated 11 months ago
mil-ad / prospr
Prospect Pruning: Finding Trainable Weights at Initialization Using Meta-Gradients
☆31Updated 3 years ago
naver / force
☆14Updated 4 years ago
JingtongSu / sanity-checking-pruning
Code for Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot
☆42Updated 4 years ago
shwinshaker / LipGrow
An adaptive training algorithm for residual network
☆15Updated 4 years ago
VITA-Group / Lifelong-Learning-LTH
[ICLR 2021] "Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning" by Tianlong Chen*, Zhenyu Zhang*, Sijia Liu, S…
☆25Updated 3 years ago
JeanKaddour / LAWA
Latest Weight Averaging (NeurIPS HITY 2022)
☆31Updated 2 years ago
google-research / understanding-transfer-learning
☆45Updated 4 years ago
juntang-zhuang / ACProp-Optimizer
Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)
☆15Updated 3 years ago
ayulockin / LossLandscape
Explores the ideas presented in Deep Ensembles: A Loss Landscape Perspective (https://arxiv.org/abs/1912.02757) by Stanislav Fort, Huiyi …
☆65Updated 4 years ago
google-research / growneuron
☆55Updated 11 months ago
VITA-Group / ToST
[ICML2022] Training Your Sparse Neural Network Better with Any Mask. Ajay Jaiswal, Haoyu Ma, Tianlong Chen, ying Ding, and Zhangyang Wang
☆28Updated 2 years ago
aoiang / LaMOO
☆29Updated 2 years ago
yaozhewei / MLPruning
MLPruning, PyTorch, NLP, BERT, Structured Pruning
☆20Updated 4 years ago
nick11roberts / XD
☆12Updated 3 years ago
lottery-ticket / rewinding-iclr20-public
☆70Updated 5 years ago
JingzhaoZhang / why-clipping-accelerates
A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification f…
☆46Updated 5 years ago
samuelstanton / gnosis
Code to reproduce experiments from 'Does Knowledge Distillation Really Work' a paper which appeared in the 2021 NeurIPS proceedings.
☆33Updated last year
briancheung / superposition
☆45Updated 5 years ago
JeanKaddour / NoTrainNoGain
Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)
☆80Updated last year
princeton-nlp / DataMUX
[NeurIPS 2022] DataMUX: Data Multiplexing for Neural Networks
☆60Updated 2 years ago
VITA-Group / Structure-LTH
[ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wa…
☆33Updated 2 years ago
VITA-Group / SMC-Bench
[ICLR 2023] "Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!" Shiwei Liu, Tianlong Chen, Zhenyu Zhang, Xuxi Chen…
☆28Updated last year
JonasGeiping / dataaugs
☆18Updated 2 years ago
AIoT-MLSys-Lab / CATE
[ICML 2021 Oral] "CATE: Computation-aware Neural Architecture Encoding with Transformers" by Shen Yan, Kaiqiang Song, Fei Liu, Mi Zhang
☆19Updated 4 years ago
jfainberg / hashed_nets
PyTorch implementation of HashedNets
☆36Updated 2 years ago
fKunstner / noise-sgd-adam-sign
☆16Updated 2 years ago