lucidrains / hamburger-pytorch
Pytorch implementation of the hamburger module from the ICLR 2021 paper "Is Attention Better Than Matrix Decomposition"
☆98Updated 4 years ago
Alternatives and similar repositories for hamburger-pytorch:
Users that are interested in hamburger-pytorch are comparing it to the libraries listed below
- Unofficial PyTorch Implementation of EvoNorm☆121Updated 3 years ago
- ☆49Updated 5 years ago
- (CVPR 2020) This repo contains code for "PADS: Policy-Adapted Sampling for Visual Similarity Learning", which proposes learnable triplet …☆60Updated 4 years ago
- ☆47Updated 4 years ago
- MoEx (Moment Exchange)☆142Updated 3 years ago
- The implementation of "Shape Adaptor: A Learnable Resizing Module" [ECCV 2020].☆73Updated 4 years ago
- Pytorch implementation of Learning Rate Dropout.☆42Updated 5 years ago
- Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorch☆57Updated 4 years ago
- Unofficial PyTorch implementation of "Filter Response Normalization Layer: Eliminating Batch Dependence in the Training of Deep Neural Ne…☆22Updated 5 years ago
- Bootstrap Your Own Latent (BYOL) pytorch implementation using DistributedDataParallel.☆28Updated 2 years ago
- Sparse Switchable Normalization with sparse activation function SparestMax☆64Updated 5 years ago
- A Pytorch implementation of "LegoNet: Efficient Convolutional Neural Networks with Lego Filters" (ICML 2019).☆140Updated 4 years ago
- ☆25Updated 4 years ago
- ☆92Updated 4 years ago
- [NeurIPS'20] GradAug: A New Regularization Method for Deep Neural Networks☆93Updated 4 years ago
- ☆62Updated 4 years ago
- "Learning Rate Dropout" in PyTorch☆34Updated 5 years ago
- Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch☆118Updated 3 years ago
- Full implementation of the paper "Rethinking Softmax with Cross-Entropy: Neural Network Classifier as Mutual Information Estimator".☆101Updated 5 years ago
- ☆182Updated 2 years ago
- A ShuffleBatchNorm layer to shuffle BatchNorm statistics across multiple GPUs☆56Updated 3 years ago
- Filter Response Normalization tested on better ImageNet baselines.☆35Updated 4 years ago
- Code for "Are labels necessary for neural architecture search"☆92Updated last year
- Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones☆198Updated 4 years ago
- Implementation of various Vision Transformers I found interesting☆84Updated 3 years ago
- (Batched) advanced indexing for PyTorch.☆53Updated 2 months ago
- [ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845☆119Updated 3 years ago
- Parametric Instance Classification for Unsupervised Visual Feature Learning, NeurIPS 2020☆52Updated 3 years ago
- Framework for creating (partially) reversible neural networks with PyTorch☆151Updated 2 years ago
- Implementation of the reversible residual network in pytorch☆104Updated 3 years ago