google-research / wide-sparse-netsLinks

☆19

Alternatives and similar repositories for wide-sparse-nets

Users that are interested in wide-sparse-nets are comparing it to the libraries listed below

Sorting:

minhtannguyen / SRSGD
Code base for SRSGD.
☆28Updated 5 years ago
BayesWatch / pytorch-blockswap
Code for BlockSwap (ICLR 2020).
☆33Updated 4 years ago
ganguli-lab / degrees-of-freedom
☆37Updated 3 years ago
mpezeshki / Gradient_Starvation
Gradient Starvation: A Learning Proclivity in Neural Networks
☆61Updated 4 years ago
JiJingYu / delta_orthogonal_init_pytorch
Delta Orthogonal Initialization for PyTorch
☆18Updated 7 years ago
fmi-basel / neural-tangent-transfer
Code accompanying our paper "Finding trainable sparse networks through Neural Tangent Transfer" to be published at ICML-2020.
☆13Updated 5 years ago
MadryLab / dataset-replication-analysis
☆25Updated 5 years ago
evcu / pytorchpruner
☆22Updated 7 years ago
singlasahil14 / barlow
Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction
☆36Updated 3 years ago
JingtongSu / sanity-checking-pruning
Code for Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot
☆41Updated 5 years ago
liamcli / darts_asha
Code release to reproduce ASHA experiments from "Random Search and Reproducibility for NAS."
☆22Updated 6 years ago
HazyResearch / augmentation_code
Reproducible code for Augmentation paper
☆17Updated 6 years ago
RAIVNLab / supsup
Code for "Supermasks in Superposition"
☆124Updated 2 years ago
hlml / fortuitous_forgetting
☆19Updated 3 years ago
IST-DASLab / WoodFisher
Code accompanying the NeurIPS 2020 paper: WoodFisher (Singh & Alistarh, 2020)
☆53Updated 4 years ago
hukkai / rescaling
[NeurIPS 2020 Oral] Is normalization indispensable for training deep neural networks?
☆34Updated 3 years ago
JingzhaoZhang / why-clipping-accelerates
A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification f…
☆46Updated 5 years ago
js-d / sim_metric
☆37Updated 2 years ago
yanivbl6 / quantized_meanfield
This repository provides code source used in the paper: A Mean Field Theory of Quantized Deep Networks: The Quantization-Depth Trade-Off
☆13Updated 6 years ago
dydjw9 / Efficient_SAM
☆58Updated 2 years ago
facebookresearch / fisher_information_loss
This code reproduces the results of the paper, "Measuring Data Leakage in Machine-Learning Models with Fisher Information"
☆50Updated 4 years ago
VITA-Group / CV_LTH_Pre-training
[CVPR 2021] "The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models" Tianlong Chen, Jon…
☆68Updated 2 years ago
lucidrains / kronecker-attention-pytorch
Implementation of Kronecker Attention in Pytorch
☆19Updated 5 years ago
lottery-ticket / rewinding-iclr20-public
☆69Updated 5 years ago
izmailovpavel / torch_swa_examples
☆47Updated 4 years ago
sayakpaul / Training-BatchNorm-and-Only-BatchNorm
Experiments with the ideas presented in https://arxiv.org/abs/2003.00152 by Frankle et al.
☆29Updated 5 years ago
hongyanz / TRADES-smoothing
[JMLR] TRADES + random smoothing for certifiable robustness
☆14Updated 5 years ago
wronnyhuang / gen-viz
Code for the paper "Understanding Generalization through Visualizations"
☆64Updated 4 years ago
yk / PyTorch_CIFAR10
Pretrained TorchVision models on CIFAR10 dataset (with weights)
☆24Updated 5 years ago
yukimasano / linear-probes
Evaluating AlexNet features at various depths
☆40Updated 5 years ago