brianlan / complex-grad-norm
a much more complex case using GradNorm, where the layer sharing situation is sophisticated.
☆15Updated 6 years ago
Alternatives and similar repositories for complex-grad-norm
Users that are interested in complex-grad-norm are comparing it to the libraries listed below
Sorting:
- ☆23Updated 4 years ago
- Implementation "Adapting Auxiliary Losses Using Gradient Similarity" article☆32Updated 6 years ago
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Updated 3 years ago
- ☆10Updated 5 years ago
- Adaptive Activation Network and Functional Regularization for Efficient and Flexible Deep Multi-Task Learning☆17Updated 5 years ago
- Pytorch implementation of "Hallucinating Agnostic Images to Generalize Across Domains"☆11Updated 5 years ago
- ICML'20: SIGUA: Forgetting May Make Learning with Noisy Labels More Robust☆15Updated 4 years ago
- NeurIPS'19: Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting (Pytorch implementation for class imbalance).☆33Updated 5 years ago
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2022 -- Distilling a Powerful Student Model via Online Knowledge Distillation☆28Updated 3 years ago
- Code release for NeurIPS 2020 paper "Stochastic Normalization"☆23Updated 3 years ago
- ☆19Updated 6 years ago
- Official repository for Reliable Label Bootstrapping☆19Updated 2 years ago
- Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"☆11Updated 3 years ago
- Ranking-based-Instance-Selection☆32Updated 3 years ago
- ☆13Updated 3 years ago
- The project is about predicting sets (of classes) from images.☆22Updated 3 years ago
- ☆20Updated 2 years ago
- Implementation of Mogrifier LSTM in PyTorch☆35Updated 5 years ago
- WeightNet: Revisiting the Design Space of Weight Networks☆19Updated 4 years ago
- Supercharging Imbalanced Data Learning WithCausal Representation Transfer☆12Updated 3 years ago
- Code release for paper "A Modulation Module for Multi-task Learning with Applications in Image Retrieval"☆32Updated 6 years ago
- A library for multi-task learning and meta-learning.☆11Updated 3 years ago
- AdaTask: A Task-Aware Adaptive Learning Rate Approach to Multi-Task Learning. AAAI, 2023.☆28Updated last year
- Code for the paper "Addressing Model Vulnerability to Distributional Shifts over Image Transformation Sets", ICCV 2019☆27Updated 5 years ago
- Distilling knowledge from ensemble of multiple teacher networks to student network with multiple heads☆8Updated 3 years ago
- ☆37Updated 2 years ago
- Code for Active Mixup in 2020 CVPR☆22Updated 3 years ago
- Interpolation between Residual and Non-Residual Networks, ICML 2020. https://arxiv.org/abs/2006.05749☆26Updated 4 years ago
- Code for Paper "Evidential Softmax for Sparse MultimodalDistributions in Deep Generative Models"☆11Updated 3 years ago
- [ICLR 2021] Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization☆41Updated 4 years ago