lucidrains / hamburger-pytorchLinks
Pytorch implementation of the hamburger module from the ICLR 2021 paper "Is Attention Better Than Matrix Decomposition"
☆99Updated 4 years ago
Alternatives and similar repositories for hamburger-pytorch
Users that are interested in hamburger-pytorch are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of Learning Rate Dropout.☆42Updated 5 years ago
- Unofficial PyTorch implementation of "Filter Response Normalization Layer: Eliminating Batch Dependence in the Training of Deep Neural Ne…☆22Updated 5 years ago
- Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorch☆58Updated 4 years ago
- Unofficial PyTorch Implementation of EvoNorm☆122Updated 3 years ago
- Unofficial implementation of Stand-Alone Self-Attention in Vision Models (obsolete)☆44Updated 6 years ago
- ☆47Updated 4 years ago
- (Batched) advanced indexing for PyTorch.☆53Updated 6 months ago
- [NeurIPS'20] GradAug: A New Regularization Method for Deep Neural Networks☆94Updated 4 years ago
- ☆49Updated 5 years ago
- PyTorch implementation of Lambda Network and pretrained Lambda-ResNet☆54Updated 4 years ago
- (CVPR 2020) This repo contains code for "PADS: Policy-Adapted Sampling for Visual Similarity Learning", which proposes learnable triplet …☆61Updated 5 years ago
- "Learning Rate Dropout" in PyTorch☆34Updated 5 years ago
- ☆93Updated 4 years ago
- Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch☆119Updated 3 years ago
- ☆54Updated 3 years ago
- Implementation of Kronecker Attention in Pytorch☆19Updated 4 years ago
- Framework for creating (partially) reversible neural networks with PyTorch☆152Updated 2 years ago
- Filter Response Normalization tested on better ImageNet baselines.☆35Updated 5 years ago
- Implementation of the reversible residual network in pytorch☆105Updated 3 years ago
- Full implementation of the paper "Rethinking Softmax with Cross-Entropy: Neural Network Classifier as Mutual Information Estimator".☆102Updated 5 years ago
- Code for "Are labels necessary for neural architecture search"☆92Updated last year
- [ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845☆120Updated 4 years ago
- Implementation of various Vision Transformers I found interesting☆84Updated 4 years ago
- Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch☆42Updated 4 years ago
- ContextLab: A Toolbox for Context Feature Augmentation developed with PyTorch☆39Updated 5 years ago
- Code for reproducing experiments in "How Useful is Self-Supervised Pretraining for Visual Tasks?"☆60Updated 11 months ago
- MoEx (Moment Exchange)☆141Updated 4 years ago
- A Pytorch implementation of "LegoNet: Efficient Convolutional Neural Networks with Lego Filters" (ICML 2019).☆140Updated 4 years ago
- The implementation of "Shape Adaptor: A Learnable Resizing Module" [ECCV 2020].☆73Updated 4 years ago
- PyTorch Examples repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆61Updated 11 months ago