brianlan / complex-grad-norm
a much more complex case using GradNorm, where the layer sharing situation is sophisticated.
☆15Updated 6 years ago
Alternatives and similar repositories for complex-grad-norm:
Users that are interested in complex-grad-norm are comparing it to the libraries listed below
- ICML'20: SIGUA: Forgetting May Make Learning with Noisy Labels More Robust☆15Updated 4 years ago
- NeurIPS'19: Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting (Pytorch implementation for class imbalance).☆33Updated 5 years ago
- Implementation "Adapting Auxiliary Losses Using Gradient Similarity" article☆32Updated 6 years ago
- Code for the paper Adversarial Robustness via Adversarial Label-Smoothing☆12Updated 5 years ago
- [NeurIPS 2021] “Improving Contrastive Learning on Imbalanced Data via Open-World Sampling”, Ziyu Jiang, Tianlong Chen, Ting Chen, Zhangya…☆28Updated 3 years ago
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2022 -- Distilling a Powerful Student Model via Online Knowledge Distillation☆28Updated 3 years ago
- ☆23Updated 4 years ago
- Adaptive Activation Network and Functional Regularization for Efficient and Flexible Deep Multi-Task Learning☆17Updated 5 years ago
- ☆20Updated 2 years ago
- ☆20Updated 6 years ago
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Updated 3 years ago
- IJCAI 2021, "Comparing Kullback-Leibler Divergence and Mean Squared Error Loss in Knowledge Distillation"☆40Updated 2 years ago
- Code for the paper "Addressing Model Vulnerability to Distributional Shifts over Image Transformation Sets", ICCV 2019☆27Updated 5 years ago
- Code for Active Mixup in 2020 CVPR☆22Updated 3 years ago
- Paper and Code for "Curriculum Learning by Optimizing Learning Dynamics" (AISTATS 2021)☆19Updated 3 years ago
- A PyTorch implementation of our proposed loss function from the paper "SimLoss: Class Similarities in Cross Entropy"☆25Updated 3 years ago
- Code release for NeurIPS 2020 paper "Stochastic Normalization"☆23Updated 2 years ago
- WeightNet: Revisiting the Design Space of Weight Networks☆19Updated 4 years ago
- This is an official implementation of our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Attentional Transforms".☆12Updated 4 years ago
- Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"☆11Updated 3 years ago
- ☆51Updated 4 years ago
- Energy-based Out-of-distribution Detection☆15Updated 4 years ago
- categorical variational autoencoder using the Gumbel-Softmax estimator☆26Updated 6 years ago
- Distilling knowledge from ensemble of multiple teacher networks to student network with multiple heads☆7Updated 3 years ago
- Ranking-based-Instance-Selection☆32Updated 3 years ago
- ☆35Updated 3 years ago
- Deep Metric Transfer for Label Propagation with Limited Annotated Data☆49Updated last year
- A PyTorch implementation for Unsupervised Data Augmentation☆23Updated 2 years ago
- Q. Yao, H. Yang, B. Han, G. Niu, J. Kwok. Searching to Exploit Memorization Effect in Learning from Noisy Labels. ICML 2020☆22Updated 4 years ago
- ☆27Updated 2 years ago