gbup-group / EAN-efficient-attention-network
The implementation of paper ''Efficient Attention Network: Accelerate Attention by Searching Where to Plug''.
☆20Updated last year
Alternatives and similar repositories for EAN-efficient-attention-network
Users that are interested in EAN-efficient-attention-network are comparing it to the libraries listed below
Sorting:
- We investigated corruption robustness across different architectures including Convolutional Neural Networks, Vision Transformers, and th…☆16Updated 3 years ago
- The official implementation of paper "Drop-Activation: Implicit Parameter Reduction and Harmonious Regularization".☆10Updated 5 years ago
- Official Pytorch implementation of the paper: "Locally Shifted Attention With Early Global Integration"☆15Updated 3 years ago
- Official implementation for "Minimax Active Learning" in PyTorch.☆9Updated 4 years ago
- The project is about predicting sets (of classes) from images.☆22Updated 3 years ago
- Official code for NeurIPS paper "Combinatorial Optimization for Panoptic Segmentation: A Fully Differentiable Approach".☆16Updated 2 years ago
- Code for our paper: "Regularity Normalization: Neuroscience-Inspired Unsupervised Attention across Neural Network Layers".☆21Updated 3 years ago
- ☆23Updated 4 years ago
- A pytorch implementation of Information Bottleneck GAN☆28Updated 6 years ago
- Implementation for NATv2.☆23Updated 4 years ago
- Self-Distillation with weighted ground-truth targets; ResNet and Kernel Ridge Regression☆18Updated 3 years ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆35Updated 3 years ago
- Self-Supervised Domain Adaptation with Consistency Training☆19Updated 4 years ago
- Bag of MLP☆20Updated 3 years ago
- Train SN-GAN with AdaBelief☆11Updated 3 years ago
- Implementation of Kronecker Attention in Pytorch☆18Updated 4 years ago
- Role-Wise Data Augmentation for Knowledge Distillation☆18Updated 2 years ago
- Interpolation between Residual and Non-Residual Networks, ICML 2020. https://arxiv.org/abs/2006.05749☆26Updated 4 years ago
- Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"☆11Updated 3 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆20Updated 3 years ago
- Implementation for <Regularizing Neural Networks via Minimizing Hyperspherical Energy> in CVPR'20.☆24Updated 4 years ago
- Code for "Bridging the Gap between f-GANs and Wasserstein GANs", ICML 2020☆14Updated 4 years ago
- PyTorch implementation of PatchAutoAugment☆23Updated 3 years ago
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Updated 3 years ago
- implements various optimizers from scratch for analysis and comparison☆9Updated 5 years ago
- ☆12Updated 5 years ago
- Bootstrap Your Own Latent (BYOL) pytorch implementation using DistributedDataParallel.☆28Updated 2 years ago
- The Shape of Data: Intrinsic Distance for Comparing Data Distributions☆12Updated 5 years ago
- Optimization and Generalization Analysis of Transduction through Gradient Boosting and Application to Multi-scale Graph Neural Networks☆13Updated 4 years ago
- An adaptive training algorithm for residual network☆15Updated 4 years ago