antonyvigouret / Pay-Attention-to-MLPsLinks
My implementation of the gMLP model from the paper "Pay Attention to MLPs".
☆25Updated 4 years ago
Alternatives and similar repositories for Pay-Attention-to-MLPs
Users that are interested in Pay-Attention-to-MLPs are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Pay Attention to MLPs☆40Updated 4 years ago
- AdaTask: A Task-Aware Adaptive Learning Rate Approach to Multi-Task Learning. AAAI, 2023.☆28Updated last year
- C-Mixup for NeurIPS 2022☆70Updated last year
- ICLR 2021 (spotlight): Graph Convolution with Low-rank Learnable Local Filters☆15Updated 4 years ago
- Implementation for our WACV 2021 paper "Multi-Loss Weighting with Coefficient of Variations"☆50Updated 4 years ago
- Reproducing the Linear Multihead Attention introduced in Linformer paper (Linformer: Self-Attention with Linear Complexity)☆76Updated 5 years ago
- PyTorch implementation of Representation Learning with Contrastive Predictive Coding by Van den Oord et al. (2018)☆85Updated 3 years ago
- add a Arg: label_smoothing for torch.nn.CrossEntropyLoss()☆14Updated 4 years ago
- Source code for "Distilling Knowledge From Graph Convolutional Networks", CVPR'20☆57Updated 2 years ago
- PyTorch implementation of RealFormer: Transformer Likes Residual Attention☆11Updated 4 years ago
- ☆35Updated 2 years ago
- ☆50Updated 2 years ago
- MODALS: Modality-agnostic Automated Data Augmentation in the Latent Space☆41Updated 4 years ago
- PyTorch Implementation of the Multi-gate Mixture-of-Experts with Exclusivity (MMoEEx)☆32Updated 3 years ago
- ☆18Updated 3 years ago
- Implementation of the paper NAST: Non-Autoregressive Spatial-Temporal Transformer for Time Series Forecasting.☆76Updated 4 years ago
- Learning to Encode Position for Transformer with Continuous Dynamical Model☆60Updated 4 years ago
- pytorch implementation of manifold-mixup☆22Updated 2 years ago
- MetaBalance algorithm for multi-task learning☆58Updated 3 years ago
- Implementation of dynamic temporal pooling (DTP) for time series classification☆39Updated 3 years ago
- Recent Advances in MLP-based Models (MLP is all you need!)☆115Updated 2 years ago
- A study of distance measures and learning methods for semi-supervised learning on time series data☆17Updated 4 years ago
- ☆22Updated 3 years ago
- ☆51Updated 3 years ago
- This repository holds the code for the paper "Deep Conditional Gaussian Mixture Model forConstrained Clustering".☆34Updated 3 years ago
- Uncertainty Aware Semi-Supervised Learning on Graph Data☆40Updated 4 years ago
- Official repository of "Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models" [ICML 2023]☆18Updated 5 months ago
- ☆16Updated last year
- The released code for the paper: ''Designing the Topology of Graph Neural Networks: A Novel Feature Fusion Perspective" in WebConf 2022☆24Updated 3 years ago
- Unsupervised Deep Embedding for Clustering Analysis (DEC)☆26Updated 4 years ago