takashiishida / floodingLinks
[ICML 2020] code for the flooding regularizer proposed in "Do We Need Zero Training Loss After Achieving Zero Training Error?"
☆92Updated 2 years ago
Alternatives and similar repositories for flooding
Users that are interested in flooding are comparing it to the libraries listed below
Sorting:
- Official implementation of Auxiliary Learning by Implicit Differentiation [ICLR 2021]☆84Updated last year
- [ICML 2021 Oral] We show pure attention suffers rank collapse, and how different mechanisms combat it.☆165Updated 4 years ago
- PyTorch Implementations of Dropout Variants☆87Updated 7 years ago
- [ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845☆120Updated 4 years ago
- implements optimal transport algorithms in pytorch☆100Updated 3 years ago
- NeurIPS 2020, Debiased Contrastive Learning☆283Updated 2 years ago
- Official PyTorch implementation of the paper "Self-Supervised Relational Reasoning for Representation Learning", NeurIPS 2020 Spotlight.☆143Updated last year
- ICML 2019: Understanding and Utilizing Deep Neural Networks Trained with Noisy Labels☆91Updated 4 years ago
- Loss and accuracy go opposite ways...right?☆95Updated 5 years ago
- Official adversarial mixup resynthesis repository☆35Updated 5 years ago
- Code for Multi-Head Attention: Collaborate Instead of Concatenate☆152Updated 2 years ago
- Hyperspherical Prototype Networks☆67Updated 5 years ago
- Implementation for our WACV 2021 paper "Multi-Loss Weighting with Coefficient of Variations"☆50Updated 4 years ago
- Implementation of Sparsemax activation in Pytorch☆162Updated 5 years ago
- Pytorch implementation of the hamburger module from the ICLR 2021 paper "Is Attention Better Than Matrix Decomposition"☆99Updated 4 years ago
- MTAdam: Automatic Balancing of Multiple Training Loss Terms☆36Updated 4 years ago
- MODALS: Modality-agnostic Automated Data Augmentation in the Latent Space☆41Updated 4 years ago
- A PyTorch implementation of the paper - "Synthesizer: Rethinking Self-Attention in Transformer Models"☆73Updated 2 years ago
- Unsupervised Data Augmentation experiments in PyTorch☆60Updated 6 years ago
- Evaluating AlexNet features at various depths☆40Updated 4 years ago
- Reparameterize your PyTorch modules☆71Updated 4 years ago
- A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification f…☆46Updated 5 years ago
- Full implementation of the paper "Rethinking Softmax with Cross-Entropy: Neural Network Classifier as Mutual Information Estimator".☆102Updated 5 years ago
- Tensorflow implementation of "Learning to Balance: Bayesian Meta-learning for Imbalanced and Out-of-distribution Tasks" (ICLR 2020 oral)☆101Updated 4 years ago
- pytorch implementation of manifold-mixup☆22Updated 2 years ago
- ☆81Updated last year
- [NeurIPS'20] GradAug: A New Regularization Method for Deep Neural Networks☆94Updated 4 years ago
- PyTorch Examples repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆61Updated last year
- Tensorflow implementation of "Meta Dropout: Learning to Perturb Latent Features for Generalization" (ICLR 2020)☆27Updated 5 years ago
- pytorch implementation of VAE-Gumble-Softmax☆63Updated 5 years ago