elis2496 / maxup_implementationLinks
☆12Updated 4 years ago
Alternatives and similar repositories for maxup_implementation
Users that are interested in maxup_implementation are comparing it to the libraries listed below
Sorting:
- Bag of MLP☆20Updated 4 years ago
- Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"☆11Updated 3 years ago
- ☆27Updated 2 years ago
- ☆23Updated 4 years ago
- Information Bottleneck Approach to Spatial Attention Learning, IJCAI2021☆15Updated 4 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆20Updated 3 years ago
- Cyclic Differentiable Architecture Search☆36Updated 3 years ago
- Self-Distillation with weighted ground-truth targets; ResNet and Kernel Ridge Regression☆18Updated 3 years ago
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Updated 2 years ago
- A PyTorch implementation of Proxy Anchor Loss based on CVPR 2020 paper "Proxy Anchor Loss for Deep Metric Learning"☆10Updated 4 years ago
- This is an official implementation of our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Attentional Transforms".☆12Updated 4 years ago
- Code for Active Mixup in 2020 CVPR☆23Updated 3 years ago
- ☆22Updated 3 years ago
- ☆16Updated last year
- Codebase for the paper "Beyond BatchNorm: Towards a Unified Understanding of Normalization in Deep Learning"☆17Updated 3 years ago
- [WACV 2022] "Sandwich Batch Normalization: A Drop-In Replacement for Feature Distribution Heterogeneity" by Xinyu Gong, Wuyang Chen, Tian…☆50Updated 3 years ago
- Implementation of Spectral Leakage and Rethinking the Kernel Size in CNNs in Pytorch☆14Updated 4 years ago
- Reducing Channel Redundancy in Convolutional Neural Networks by Features Recombining (TIP 2021)☆18Updated 2 years ago
- Implementation for NATv2.☆23Updated 4 years ago
- CoaT: Co-Scale Conv-Attentional Image Transformers☆16Updated 4 years ago
- ☆13Updated 5 years ago
- ☆28Updated 5 years ago
- ☆25Updated 3 years ago
- Role-Wise Data Augmentation for Knowledge Distillation☆19Updated 2 years ago
- Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch☆53Updated 4 years ago
- Evolving Normalization-Activation Layers☆19Updated 5 years ago
- [ICLR 2022]: Fast AdvProp☆35Updated 3 years ago
- ☆19Updated 4 years ago
- ☆41Updated 4 years ago
- WeightNet: Revisiting the Design Space of Weight Networks☆19Updated 4 years ago