datnnt1997 / multi-head_self-attentionLinks
A Faster Pytorch Implementation of Multi-Head Self-Attention
☆74Updated 3 years ago
Alternatives and similar repositories for multi-head_self-attention
Users that are interested in multi-head_self-attention are comparing it to the libraries listed below
Sorting:
- Multi-head attention in PyTorch☆153Updated 6 years ago
- Simple pytorch implementation of focal loss☆86Updated 2 years ago
- my codes for learning attention mechanism☆50Updated 5 years ago
- Official TensorFlow code for the paper "Efficient-CapsNet: Capsule Network with Self-Attention Routing".☆273Updated 3 years ago
- Pseudo Labeling for Neural Networks and Logistic Regression/SVMs ( Based on "Pseudo-Label : The Simple and Efficient Semi-Supervised Lear…☆74Updated 5 years ago
- Tensorflow implementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)☆222Updated 4 years ago
- Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms☆259Updated 4 years ago
- LSTM and GRU in PyTorch☆264Updated 6 years ago
- This is a repository for Multi-task learning with toy data in Pytorch and Tensorflow☆137Updated 7 years ago
- PyTorch implementation of some attentions for Deep Learning Researchers.☆546Updated 3 years ago
- ☆154Updated 2 years ago
- Experiments with supervised contrastive learning methods with different loss functions☆221Updated 2 years ago
- Transformer implementation in PyTorch.☆491Updated 6 years ago
- Pytorch implementation of Masked Auto-Encoder☆40Updated 3 years ago
- A PyTorch implementation of the TCAN model in "Temporal Convolutional Attention-based Network For Sequence Modeling".☆143Updated 2 years ago
- A PyTorch Tutorials of Sentiment Analysis Classification (RNN, LSTM, Bi-LSTM, LSTM+Attention, CNN)☆326Updated 2 years ago
- Independent implementation of Supervised Contrastive Loss. Straight to the point and beyond☆84Updated 4 years ago
- Multi-task learning using uncertainty to weigh losses for scene geometry and semantics, Auxiliary Tasks in Multi-task Learning☆636Updated 5 years ago
- A Variational Autoencoder based on the ResNet18-architecture☆121Updated 6 years ago
- Generate high quality images for each class even with an imbalanced dataset. An improved version of Balancing GAN.☆38Updated 3 years ago
- Pytorch implementation of "DeepSMOTE: Fusing Deep Learning and SMOTE for Imbalanced Data".☆120Updated 4 years ago
- This is a re-implementation of TransGAN: Two Pure Transformers Can Make One Strong GAN (NeurIPS 2021) in PyTorch.☆115Updated last year
- PyTorch implementation of the GradNorm☆106Updated last year
- A pytorch implementation of the paper Unsupervised Deep Embedding for Clustering Analysis.☆146Updated 6 years ago
- My implementation of the gMLP model from the paper "Pay Attention to MLPs".☆24Updated 4 years ago
- A Pytorch Implementation of a denoising autoencoder.☆46Updated 6 years ago
- About Code release for "Flowformer: Linearizing Transformers with Conservation Flows" (ICML 2022), https://arxiv.org/pdf/2202.06258.pdf☆326Updated last year
- An All-MLP solution for Vision, from Google AI☆1,045Updated 3 months ago
- Implementation of GraphKan with torch geometrics and its application on signal classification☆66Updated last year
- wirte simple models by pytorch,such as lstm/gru/bilstm☆38Updated 3 years ago