brianlan / complex-grad-norm
a much more complex case using GradNorm, where the layer sharing situation is sophisticated.
☆15Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for complex-grad-norm
- ☆19Updated last year
- NeurIPS'19: Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting (Pytorch implementation for class imbalance).☆34Updated 5 years ago
- Distilling knowledge from ensemble of multiple teacher networks to student network with multiple heads☆7Updated 3 years ago
- Adaptive Activation Network and Functional Regularization for Efficient and Flexible Deep Multi-Task Learning☆17Updated 4 years ago
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Updated 3 years ago
- WeightNet: Revisiting the Design Space of Weight Networks☆18Updated 3 years ago
- IJCAI 2021, "Comparing Kullback-Leibler Divergence and Mean Squared Error Loss in Knowledge Distillation"☆39Updated last year
- ☆11Updated 7 months ago
- ICML'20: SIGUA: Forgetting May Make Learning with Noisy Labels More Robust☆13Updated 3 years ago
- ☆19Updated 5 years ago
- Pytorch implementation of "Hallucinating Agnostic Images to Generalize Across Domains"☆11Updated 5 years ago
- Implementation "Adapting Auxiliary Losses Using Gradient Similarity" article☆32Updated 5 years ago
- Paper and Code for "Curriculum Learning by Optimizing Learning Dynamics" (AISTATS 2021)☆19Updated 3 years ago
- ☆13Updated 3 years ago
- ☆13Updated 3 years ago
- Interpolation between Residual and Non-Residual Networks, ICML 2020. https://arxiv.org/abs/2006.05749☆26Updated 4 years ago
- ☆23Updated 4 years ago
- ☆10Updated 4 years ago
- Breaking the Curse of Space Explosion: Towards Efficient NAS with Curriculum Search☆17Updated 3 months ago
- Code repositoy for "AOWS: Adaptive and optimal network width search with latency constraints", CVPR 2020☆35Updated 4 years ago
- Self-Supervised Domain Adaptation with Consistency Training☆19Updated 4 years ago
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2022 -- Distilling a Powerful Student Model via Online Knowledge Distillation☆28Updated 2 years ago
- A PyTorch implementation for Unsupervised Data Augmentation☆23Updated 2 years ago
- ICLR 2021 (spotlight): Graph Convolution with Low-rank Learnable Local Filters☆15Updated 3 years ago
- Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"☆11Updated 3 years ago
- Code for the paper Adversarial Robustness via Adversarial Label-Smoothing☆12Updated 4 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆20Updated 2 years ago
- growing interpretable part graphs on convnets via multi-shot learning, in AAAI 2017☆16Updated 7 years ago
- Official Implementation of Convolutional Normalization: Improving Robustness and Training for Deep Neural Networks☆30Updated 2 years ago