lucidrains / hamburger-pytorchLinks
Pytorch implementation of the hamburger module from the ICLR 2021 paper "Is Attention Better Than Matrix Decomposition"
☆99Updated 4 years ago
Alternatives and similar repositories for hamburger-pytorch
Users that are interested in hamburger-pytorch are comparing it to the libraries listed below
Sorting:
- Unofficial PyTorch Implementation of EvoNorm☆122Updated 3 years ago
- ☆47Updated 4 years ago
- Unofficial PyTorch implementation of "Filter Response Normalization Layer: Eliminating Batch Dependence in the Training of Deep Neural Ne…☆22Updated 5 years ago
- Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorch☆58Updated 4 years ago
- Pytorch implementation of Learning Rate Dropout.☆42Updated 5 years ago
- Unofficial implementation of Stand-Alone Self-Attention in Vision Models (obsolete)☆44Updated 6 years ago
- "Learning Rate Dropout" in PyTorch☆34Updated 5 years ago
- [NeurIPS'20] GradAug: A New Regularization Method for Deep Neural Networks☆94Updated 4 years ago
- Full implementation of the paper "Rethinking Softmax with Cross-Entropy: Neural Network Classifier as Mutual Information Estimator".☆102Updated 5 years ago
- Pytorch implementation of the image transformer for unconditional image generation☆118Updated last year
- (CVPR 2020) This repo contains code for "PADS: Policy-Adapted Sampling for Visual Similarity Learning", which proposes learnable triplet …☆61Updated 5 years ago
- A Pytorch implementation of "LegoNet: Efficient Convolutional Neural Networks with Lego Filters" (ICML 2019).☆140Updated 4 years ago
- Implementation of Kronecker Attention in Pytorch☆19Updated 4 years ago
- Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch☆42Updated 4 years ago
- [ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845☆120Updated 4 years ago
- PyTorch implementation of Lambda Network and pretrained Lambda-ResNet☆54Updated 4 years ago
- ☆26Updated 4 years ago
- ☆49Updated 5 years ago
- A Pytorch implementation of Global Self-Attention Network, a fully-attention backbone for vision tasks☆95Updated 4 years ago
- A pytorch implementation of Information Bottleneck GAN☆28Updated 6 years ago
- ☆54Updated 3 years ago
- Improving generalization by controlling label-noise information in neural network weights.☆40Updated 4 years ago
- The implementation of "Shape Adaptor: A Learnable Resizing Module" [ECCV 2020].☆73Updated 4 years ago
- PyTorch Examples repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆61Updated last year
- Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch☆119Updated 4 years ago
- Code for "Are labels necessary for neural architecture search"☆92Updated last year
- (Batched) advanced indexing for PyTorch.☆53Updated 7 months ago
- ☆93Updated 4 years ago
- Implementation for our paper exploring a novel 2D adaptive attention span kernel in computer vision.☆35Updated last year
- Implementation of various Vision Transformers I found interesting☆84Updated 4 years ago