hoya012 / swa-tutorials-pytorchLinks
Stochastic Weight Averaging Tutorials using pytorch.
☆33Updated 5 years ago
Alternatives and similar repositories for swa-tutorials-pytorch
Users that are interested in swa-tutorials-pytorch are comparing it to the libraries listed below
Sorting:
- ☆96Updated 4 years ago
- ☆38Updated 5 years ago
- Official Implementation of Unweighted Data Subsampling via Influence Function - AAAI 2020☆64Updated 4 years ago
- Code for Active Mixup in 2020 CVPR☆23Updated 3 years ago
- MODALS: Modality-agnostic Automated Data Augmentation in the Latent Space☆41Updated 4 years ago
- AAAI 2021: Robustness of Accuracy Metric and its Inspirations in Learning with Noisy Labels☆23Updated 4 years ago
- ☆36Updated 5 years ago
- Pytorch implementation for "The Surprising Positive Knowledge Transfer in Continual 3D Object Shape Reconstruction"☆33Updated 3 years ago
- ☆109Updated 2 years ago
- [ICML 2020] code for the flooding regularizer proposed in "Do We Need Zero Training Loss After Achieving Zero Training Error?"☆94Updated 2 years ago
- Evaluating AlexNet features at various depths☆40Updated 5 years ago
- A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification f…☆46Updated 5 years ago
- ☆57Updated 3 years ago
- Implementation of Online Label Smoothing in PyTorch☆95Updated 3 years ago
- [TPAMI2022 & NeurIPS2020] Official implementation of Self-Adaptive Training☆130Updated 4 years ago
- [ICLR 2021 Spotlight Oral] "Undistillable: Making A Nasty Teacher That CANNOT teach students", Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Che…☆82Updated 3 years ago
- Tensorflow implementation of "Meta Dropout: Learning to Perturb Latent Features for Generalization" (ICLR 2020)☆27Updated 5 years ago
- [ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845☆120Updated 4 years ago
- IJCAI 2021, "Comparing Kullback-Leibler Divergence and Mean Squared Error Loss in Knowledge Distillation"☆42Updated 2 years ago
- Official implementation of Auxiliary Learning by Implicit Differentiation [ICLR 2021]☆86Updated last year
- ☆83Updated last year
- Implementation of the paper Identifying Mislabeled Data using the Area Under the Margin Ranking: https://arxiv.org/pdf/2001.10528v2.pdf☆21Updated 5 years ago
- Code for paper "Can contrastive learning avoid shortcut solutions?" NeurIPS 2021.☆47Updated 3 years ago
- Code for the ICML 2021 paper "Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation", Haoxi…☆68Updated 4 years ago
- Code for "Supermasks in Superposition"☆124Updated 2 years ago
- Gradient Starvation: A Learning Proclivity in Neural Networks☆61Updated 4 years ago
- Implementation of the paper "Understanding anomaly detection with deep invertible networks through hierarchies of distributions and featu…☆42Updated 4 years ago
- ☆31Updated 4 years ago
- Explores the ideas presented in Deep Ensembles: A Loss Landscape Perspective (https://arxiv.org/abs/1912.02757) by Stanislav Fort, Huiyi …☆66Updated 5 years ago
- Zero-Shot Knowledge Distillation in Deep Networks☆67Updated 3 years ago