brianlan / complex-grad-normLinks
a much more complex case using GradNorm, where the layer sharing situation is sophisticated.
☆15Updated 6 years ago
Alternatives and similar repositories for complex-grad-norm
Users that are interested in complex-grad-norm are comparing it to the libraries listed below
Sorting:
- Adaptive Activation Network and Functional Regularization for Efficient and Flexible Deep Multi-Task Learning☆17Updated 5 years ago
- NeurIPS'19: Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting (Pytorch implementation for class imbalance).☆33Updated 5 years ago
- Implementation "Adapting Auxiliary Losses Using Gradient Similarity" article☆32Updated 6 years ago
- ICML'20: SIGUA: Forgetting May Make Learning with Noisy Labels More Robust☆15Updated 4 years ago
- The implementation of paper ''Efficient Attention Network: Accelerate Attention by Searching Where to Plug''.☆20Updated last year
- Code release for NeurIPS 2020 paper "Stochastic Normalization"☆23Updated 3 years ago
- Ranking-based-Instance-Selection☆32Updated 3 years ago
- ☆23Updated 4 years ago
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Updated 3 years ago
- Code for the paper Adversarial Robustness via Adversarial Label-Smoothing☆11Updated 5 years ago
- ☆20Updated 2 years ago
- Pytorch implementation of "Hallucinating Agnostic Images to Generalize Across Domains"☆11Updated 5 years ago
- Code for the paper "Addressing Model Vulnerability to Distributional Shifts over Image Transformation Sets", ICCV 2019☆27Updated 5 years ago
- A PyTorch implementation for Unsupervised Data Augmentation☆23Updated 2 years ago
- ☆10Updated 5 years ago
- Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"☆11Updated 3 years ago
- ☆19Updated 6 years ago
- AdaX: Adaptive Gradient Descent with Exponential Long Term Momery☆34Updated 5 years ago
- [NeurIPS 2021] “Improving Contrastive Learning on Imbalanced Data via Open-World Sampling”, Ziyu Jiang, Tianlong Chen, Ting Chen, Zhangya…☆28Updated 3 years ago
- Official repository for Reliable Label Bootstrapping☆19Updated 2 years ago
- ☆20Updated 4 years ago
- ☆27Updated 2 years ago
- ICLR 2021 (spotlight): Graph Convolution with Low-rank Learnable Local Filters☆15Updated 4 years ago
- Unofficial implement of CLSA(Contrastive Learning with Stronger Augmentations) with minimum modifications on official moco's code☆32Updated 4 years ago
- IJCAI 2021, "Comparing Kullback-Leibler Divergence and Mean Squared Error Loss in Knowledge Distillation"☆41Updated 2 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆20Updated 3 years ago
- Distilling knowledge from ensemble of multiple teacher networks to student network with multiple heads☆8Updated 3 years ago
- Code release for paper "A Modulation Module for Multi-task Learning with Applications in Image Retrieval"☆32Updated 6 years ago
- Code for NeurIPS 2019 paper "Screening Sinkhorn Algorithm for Regularized Optimal Transport"☆10Updated 5 years ago
- Graph Knowledge Distillation☆13Updated 5 years ago