NydiaAI / g-mlp-tensorflowLinks
A gMLP (gated MLP) implementation in Tensorflow 1.x, as described in the paper "Pay Attention to MLPs" (2105.08050).
☆16Updated 4 years ago
Alternatives and similar repositories for g-mlp-tensorflow
Users that are interested in g-mlp-tensorflow are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Pay Attention to MLPs☆40Updated 4 years ago
- Transformers are Graph Neural Networks!☆54Updated 4 years ago
- DECAF: Deep Extreme Classification with Label Features☆54Updated 3 years ago
- A PyTorch implementation of the paper - "Synthesizer: Rethinking Self-Attention in Transformer Models"☆73Updated 2 years ago
- Pytorch implementation of "Block Recurrent Transformers" (Hutchins & Schlag et al., 2022)☆85Updated 3 years ago
- Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."☆133Updated 4 years ago
- Unsupervised Anomaly Detection via Deep Metric Learning with End-to-End Optimization☆12Updated 2 years ago
- Repository for Multimodal AutoML Benchmark☆65Updated 3 years ago
- a simple pytorch implement of Multi-Sample Dropout☆57Updated 6 years ago
- Implementation of Mogrifier LSTM in PyTorch☆34Updated 5 years ago
- [EMNLP'19] Summary for Transformer Understanding☆53Updated 5 years ago
- Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Tra…☆32Updated 4 years ago
- A quick walk-through of the innards of LSTMs and a naive implementation of the Mogrifier LSTM paper in PyTorch☆78Updated 5 years ago
- 基于Transformer的单模型、多尺度的VAE模型☆57Updated 4 years ago
- Tensorflow port implementation of Single Headed Attention RNN☆16Updated 5 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Updated 3 years ago
- A span-based joint named entity recognition (NER) and relation extraction model.☆10Updated 5 years ago
- Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch☆70Updated 5 years ago
- Implementation of RealFormer using pytorch☆101Updated 4 years ago
- Unsupervised Data Augmentation experiments in PyTorch☆60Updated 6 years ago
- Implementation of dynamic temporal pooling (DTP) for time series classification☆38Updated 3 years ago
- 🤗An unofficial PyTorch implementation of ConvBert based on huggingface/transformers.☆16Updated 2 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at SDU@AAAI-22☆14Updated 2 years ago
- Efficient Neural Interaction Functions Search for Collaborative Filtering☆18Updated 5 years ago
- PyTorch Implementation of the Multi-gate Mixture-of-Experts with Exclusivity (MMoEEx)☆32Updated 4 years ago
- Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".☆44Updated 3 years ago
- 简单的挖矿病毒查杀脚本☆18Updated 3 years ago
- Python下shuffle几百G 文件☆33Updated 4 years ago
- A simple implementation of a deep linear Pytorch module☆21Updated 4 years ago