datnnt1997 / multi-head_self-attentionLinks
A Faster Pytorch Implementation of Multi-Head Self-Attention
☆75Updated 3 years ago
Alternatives and similar repositories for multi-head_self-attention
Users that are interested in multi-head_self-attention are comparing it to the libraries listed below
Sorting:
- LSTM and GRU in PyTorch☆272Updated 6 years ago
- Multi-head attention in PyTorch☆156Updated 6 years ago
- This is a repository for Multi-task learning with toy data in Pytorch and Tensorflow☆137Updated 7 years ago
- my codes for learning attention mechanism☆51Updated 5 years ago
- Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms☆261Updated 4 years ago
- PyTorch implementation of some attentions for Deep Learning Researchers.☆547Updated 3 years ago
- A PyTorch Tutorials of Sentiment Analysis Classification (RNN, LSTM, Bi-LSTM, LSTM+Attention, CNN)☆329Updated 2 years ago
- Implementation of Transformer encoder in PyTorch☆70Updated 5 years ago
- Pseudo Labeling for Neural Networks and Logistic Regression/SVMs ( Based on "Pseudo-Label : The Simple and Efficient Semi-Supervised Lear…☆75Updated 6 years ago
- PyTorch implementation of sparse autoencoder.☆40Updated 6 years ago
- Independent implementation of Supervised Contrastive Loss. Straight to the point and beyond☆85Updated 5 years ago
- A Pytorch Implementation of a denoising autoencoder.☆47Updated 6 years ago
- Simple pytorch implementation of focal loss☆86Updated 2 years ago
- ☆156Updated 2 years ago
- Multi-task learning using uncertainty to weigh losses for scene geometry and semantics, Auxiliary Tasks in Multi-task Learning☆642Updated 5 years ago
- PyTorch implementations of several SOTA backbone deep neural networks (such as ResNet, ResNeXt, RegNet) on one-dimensional (1D) signal/ti…☆523Updated 3 years ago
- Experiments with supervised contrastive learning methods with different loss functions☆223Updated 3 years ago
- A PyTorch implementation of the TCAN model in "Temporal Convolutional Attention-based Network For Sequence Modeling".☆142Updated 2 years ago
- A Variational Autoencoder based on the ResNet18-architecture☆121Updated 6 years ago
- Custom loss functions to use in (mainly) PyTorch.☆39Updated 5 years ago
- This code reproduces the results presented in the paper "DAEMA: Denoising Autoencoder with Mask Attention" accepted at the ICANN 2021 con…☆14Updated 4 years ago
- Pytorch implementation of "DeepSMOTE: Fusing Deep Learning and SMOTE for Imbalanced Data".☆121Updated 4 years ago
- My implementation of the gMLP model from the paper "Pay Attention to MLPs".☆25Updated 4 years ago
- Official TensorFlow code for the paper "Efficient-CapsNet: Capsule Network with Self-Attention Routing".☆273Updated last week
- PyTorch implementation of the GradNorm☆117Updated last year
- Tensorflow implementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)☆224Updated 5 years ago
- My implementation of the transformer architecture from the Attention is All you need paper applied to time series.☆322Updated 4 years ago
- wirte simple models by pytorch,such as lstm/gru/bilstm☆38Updated 3 years ago
- The aim of this repository is to create RBMs, EBMs and DBNs in generalized manner, so as to allow modification and variation in model typ…☆66Updated last year
- PyTorch implementation of Representation Learning with Contrastive Predictive Coding by Van den Oord et al. (2018)☆89Updated 4 years ago