antonyvigouret / Pay-Attention-to-MLPsLinks
My implementation of the gMLP model from the paper "Pay Attention to MLPs".
☆25Updated 4 years ago
Alternatives and similar repositories for Pay-Attention-to-MLPs
Users that are interested in Pay-Attention-to-MLPs are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of the Multi-gate Mixture-of-Experts with Exclusivity (MMoEEx)☆32Updated 4 years ago
- Pytorch implementation of the GradNorm. GradNorm addresses the problem of balancing multiple losses for multi-task learning by learning a…☆270Updated 3 years ago
- This in my Demo of Chen et al. "GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks" ICML 2018☆179Updated 3 years ago
- [ICML2020] "Self-PU: Self Boosted and Calibrated Positive-Unlabeled Training" by Xuxi Chen, Wuyang Chen, Tianlong Chen, Ye Yuan, Chen Gon…☆69Updated 3 years ago
- Recent Advances in MLP-based Models (MLP is all you need!)☆116Updated 2 years ago
- PyTorch implementation of Representation Learning with Contrastive Predictive Coding by Van den Oord et al. (2018)☆88Updated 3 years ago
- Custom loss functions to use in (mainly) PyTorch.☆39Updated 4 years ago
- Implementation of dynamic temporal pooling (DTP) for time series classification☆38Updated 3 years ago
- Reproducing the Linear Multihead Attention introduced in Linformer paper (Linformer: Self-Attention with Linear Complexity)☆75Updated 5 years ago
- Pseudo Labeling for Neural Networks and Logistic Regression/SVMs ( Based on "Pseudo-Label : The Simple and Efficient Semi-Supervised Lear…☆74Updated 5 years ago
- ☆27Updated 4 years ago
- This is a repository for Multi-task learning with toy data in Pytorch and Tensorflow☆136Updated 7 years ago
- MSc group project: Reproduction of 'Multi-Task Learning using Uncertainty to Weigh Losses for Scene Geometry and Semantics'; A. Kendall, …☆91Updated 5 years ago
- Pytorch implementation of risk estimators for unbiased and non-negative positive-unlabeled learning☆90Updated last year
- AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning☆112Updated 4 years ago
- AdaTask: A Task-Aware Adaptive Learning Rate Approach to Multi-Task Learning. AAAI, 2023.☆28Updated last year
- C-Mixup for NeurIPS 2022☆73Updated last year
- Official implementation of Auxiliary Learning by Implicit Differentiation [ICLR 2021]☆85Updated last year
- [AAAI 2021] Curriculum Labeling: Revisiting Pseudo-Labeling for Semi-Supervised Learning☆139Updated 4 years ago
- [ICML 2022] RankSim: Ranking Similarity Regularization for Deep Imbalanced Regression☆52Updated 2 years ago
- Official PyTorch implementation of "Meta-Calibration: Learning of Model Calibration Using Differentiable Expected Calibration Error"☆36Updated 2 years ago
- Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms☆259Updated 4 years ago
- A pytorch &keras implementation and demo of Fastformer.☆189Updated 3 years ago
- Implementation of the paper NAST: Non-Autoregressive Spatial-Temporal Transformer for Time Series Forecasting.☆77Updated 4 years ago
- PyTorch implementation of Pay Attention to MLPs☆40Updated 4 years ago
- ☆148Updated 3 years ago
- Second-Order Pooling for Graph Neural Networks☆16Updated 5 years ago
- A pytorch implementation of MCDO(Monte-Carlo Dropout methods)☆57Updated 6 years ago
- ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time Series Classification☆73Updated last year
- MODALS: Modality-agnostic Automated Data Augmentation in the Latent Space☆41Updated 4 years ago