brianlan / complex-grad-normLinks
a much more complex case using GradNorm, where the layer sharing situation is sophisticated.
☆15Updated 6 years ago
Alternatives and similar repositories for complex-grad-norm
Users that are interested in complex-grad-norm are comparing it to the libraries listed below
Sorting:
- Implementation "Adapting Auxiliary Losses Using Gradient Similarity" article☆32Updated 6 years ago
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Updated 3 years ago
- ICLR 2021 (spotlight): Graph Convolution with Low-rank Learnable Local Filters☆15Updated 4 years ago
- Self-Supervised Domain Adaptation with Consistency Training☆19Updated 4 years ago
- Distilling knowledge from ensemble of multiple teacher networks to student network with multiple heads☆8Updated 3 years ago
- IJCAI 2021, "Comparing Kullback-Leibler Divergence and Mean Squared Error Loss in Knowledge Distillation"☆41Updated 2 years ago
- ☆13Updated 3 years ago
- NeurIPS'19: Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting (Pytorch implementation for class imbalance).☆33Updated 5 years ago
- ☆10Updated 5 years ago
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2022 -- Distilling a Powerful Student Model via Online Knowledge Distillation☆28Updated 3 years ago
- Official repository for Reliable Label Bootstrapping☆19Updated 2 years ago
- ☆20Updated 2 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆20Updated 3 years ago
- Code for the paper Adversarial Robustness via Adversarial Label-Smoothing☆11Updated 5 years ago
- MSc group project: Reproduction of 'Multi-Task Learning using Uncertainty to Weigh Losses for Scene Geometry and Semantics'; A. Kendall, …☆89Updated 5 years ago
- Adaptive Activation Network and Functional Regularization for Efficient and Flexible Deep Multi-Task Learning☆17Updated 5 years ago
- ICML'20: SIGUA: Forgetting May Make Learning with Noisy Labels More Robust☆15Updated 4 years ago
- The implementation of paper ''Efficient Attention Network: Accelerate Attention by Searching Where to Plug''.☆20Updated 2 years ago
- A pytorch implementation of DVIB(Deep Variational Information Bottleneck)☆9Updated 6 years ago
- ☆42Updated 5 years ago
- A PyTorch implementation for Unsupervised Data Augmentation☆23Updated 2 years ago
- ☆19Updated 6 years ago
- Implementation of the Heterogeneous Knowledge Distillation using Information Flow Modeling method☆25Updated 5 years ago
- Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"☆11Updated 3 years ago
- Interpolation between Residual and Non-Residual Networks, ICML 2020. https://arxiv.org/abs/2006.05749☆26Updated 4 years ago
- [NeurIPS 2021] “Improving Contrastive Learning on Imbalanced Data via Open-World Sampling”, Ziyu Jiang, Tianlong Chen, Ting Chen, Zhangya…☆28Updated 3 years ago
- Pytorch implementation of "Hallucinating Agnostic Images to Generalize Across Domains"☆11Updated 5 years ago
- Code release for NeurIPS 2020 paper "Stochastic Normalization"☆23Updated 3 years ago
- ☆35Updated 3 years ago
- ☆18Updated 3 years ago