datnnt1997 / multi-head_self-attentionLinks
A Faster Pytorch Implementation of Multi-Head Self-Attention
☆74Updated 3 years ago
Alternatives and similar repositories for multi-head_self-attention
Users that are interested in multi-head_self-attention are comparing it to the libraries listed below
Sorting:
- Multi-head attention in PyTorch☆153Updated 6 years ago
- my codes for learning attention mechanism☆50Updated 5 years ago
- Official TensorFlow code for the paper "Efficient-CapsNet: Capsule Network with Self-Attention Routing".☆273Updated 3 years ago
- LSTM and GRU in PyTorch☆262Updated 6 years ago
- Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms☆259Updated 4 years ago
- Pseudo Labeling for Neural Networks and Logistic Regression/SVMs ( Based on "Pseudo-Label : The Simple and Efficient Semi-Supervised Lear…☆73Updated 5 years ago
- A pytorch implementation of Capsule Network.☆98Updated last year
- My implementation of the gMLP model from the paper "Pay Attention to MLPs".☆25Updated 4 years ago
- This is a repository for Multi-task learning with toy data in Pytorch and Tensorflow☆136Updated 6 years ago
- Image classification using Graph Neural Networks (GNNs) with MNIST dataset☆43Updated 2 years ago
- Implementation of Transformer encoder in PyTorch☆67Updated 5 years ago
- Multi-task learning using uncertainty to weigh losses for scene geometry and semantics, Auxiliary Tasks in Multi-task Learning☆633Updated 5 years ago
- wirte simple models by pytorch,such as lstm/gru/bilstm☆38Updated 2 years ago
- Simple pytorch implementation of focal loss☆86Updated 2 years ago
- Tensorflow implementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)☆218Updated 4 years ago
- PyTorch implementation of Representation Learning with Contrastive Predictive Coding by Van den Oord et al. (2018)☆87Updated 3 years ago
- PyTorch implementation of the Graph Convolutional Network by Kipf et al.☆28Updated 4 years ago
- Independent implementation of Supervised Contrastive Loss. Straight to the point and beyond☆84Updated 4 years ago
- Transformer implementation in PyTorch.☆491Updated 6 years ago
- A PyTorch Tutorials of Sentiment Analysis Classification (RNN, LSTM, Bi-LSTM, LSTM+Attention, CNN)☆324Updated 2 years ago
- Implementation of Axial attention - attending to multi-dimensional data efficiently☆385Updated 4 years ago
- Experiments with supervised contrastive learning methods with different loss functions☆221Updated 2 years ago
- An All-MLP solution for Vision, from Google AI☆1,037Updated last month
- Custom loss functions to use in (mainly) PyTorch.☆39Updated 4 years ago
- A PyTorch implementation of "CoAtNet: Marrying Convolution and Attention for All Data Sizes"☆388Updated 3 years ago
- ☆152Updated 2 years ago
- This code reproduces the results presented in the paper "DAEMA: Denoising Autoencoder with Mask Attention" accepted at the ICANN 2021 con…☆14Updated 4 years ago
- A PyTorch implementation of the TCAN model in "Temporal Convolutional Attention-based Network For Sequence Modeling".☆142Updated 2 years ago
- A pytorch implementation of the paper Unsupervised Deep Embedding for Clustering Analysis.☆144Updated 6 years ago
- About Code release for "Flowformer: Linearizing Transformers with Conservation Flows" (ICML 2022), https://arxiv.org/pdf/2202.06258.pdf☆326Updated last year