antonyvigouret / Pay-Attention-to-MLPsLinks
My implementation of the gMLP model from the paper "Pay Attention to MLPs".
☆25Updated 4 years ago
Alternatives and similar repositories for Pay-Attention-to-MLPs
Users that are interested in Pay-Attention-to-MLPs are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Pay Attention to MLPs☆40Updated 3 years ago
- add a Arg: label_smoothing for torch.nn.CrossEntropyLoss()☆14Updated 4 years ago
- Reproducing the Linear Multihead Attention introduced in Linformer paper (Linformer: Self-Attention with Linear Complexity)☆76Updated 4 years ago
- ☆22Updated 3 years ago
- AdaTask: A Task-Aware Adaptive Learning Rate Approach to Multi-Task Learning. AAAI, 2023.☆28Updated last year
- PyTorch Implementation of the Multi-gate Mixture-of-Experts with Exclusivity (MMoEEx)☆32Updated 3 years ago
- Source code for "Distilling Knowledge From Graph Convolutional Networks", CVPR'20☆57Updated 2 years ago
- custom pytorch implementation of MoCo v3☆45Updated 4 years ago
- PyTorch implementation of Representation Learning with Contrastive Predictive Coding by Van den Oord et al. (2018)☆85Updated 3 years ago
- Implementation of the paper NAST: Non-Autoregressive Spatial-Temporal Transformer for Time Series Forecasting.☆76Updated 4 years ago
- Implementation of dynamic temporal pooling (DTP) for time series classification☆40Updated 3 years ago
- ☆50Updated 2 years ago
- PyTorch implementation of RealFormer: Transformer Likes Residual Attention☆11Updated 4 years ago
- Official implementation of Auxiliary Learning by Implicit Differentiation [ICLR 2021]☆84Updated 10 months ago
- Implementation for our WACV 2021 paper "Multi-Loss Weighting with Coefficient of Variations"☆50Updated 4 years ago
- C-Mixup for NeurIPS 2022☆70Updated last year
- pytorch implementation of manifold-mixup☆22Updated 2 years ago
- MODALS: Modality-agnostic Automated Data Augmentation in the Latent Space☆41Updated 4 years ago
- Official repository of "Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models" [ICML 2023]☆18Updated 4 months ago
- Recent Advances in MLP-based Models (MLP is all you need!)☆115Updated 2 years ago
- ☆17Updated 2 years ago
- ☆18Updated 3 years ago
- Learning to Encode Position for Transformer with Continuous Dynamical Model☆60Updated 4 years ago
- Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding☆49Updated 8 months ago
- MPVAE: Multivariate Probit Variational AutoEncoder for Multi-Label Classification☆31Updated 8 months ago
- a much more complex case using GradNorm, where the layer sharing situation is sophisticated.☆15Updated 6 years ago
- MetaBalance algorithm for multi-task learning☆58Updated 3 years ago
- Self Supervised Learning for Time Series Using Similarity Distillation☆10Updated 2 years ago
- Implementation of Mogrifier LSTM in PyTorch☆35Updated 5 years ago
- ☆50Updated 3 years ago