datnnt1997 / multi-head_self-attentionLinks
A Faster Pytorch Implementation of Multi-Head Self-Attention
☆74Updated 3 years ago
Alternatives and similar repositories for multi-head_self-attention
Users that are interested in multi-head_self-attention are comparing it to the libraries listed below
Sorting:
- my codes for learning attention mechanism☆50Updated 5 years ago
- Multi-head attention in PyTorch☆154Updated 6 years ago
- Pseudo Labeling for Neural Networks and Logistic Regression/SVMs ( Based on "Pseudo-Label : The Simple and Efficient Semi-Supervised Lear…☆74Updated 5 years ago
- Simple pytorch implementation of focal loss☆86Updated 2 years ago
- LSTM and GRU in PyTorch☆264Updated 6 years ago
- Independent implementation of Supervised Contrastive Loss. Straight to the point and beyond☆84Updated 4 years ago
- Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms☆259Updated 4 years ago
- This is a repository for Multi-task learning with toy data in Pytorch and Tensorflow☆137Updated 7 years ago
- My implementation of the gMLP model from the paper "Pay Attention to MLPs".☆24Updated 4 years ago
- Tensorflow implementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)☆222Updated 5 years ago
- PyTorch implementation of some attentions for Deep Learning Researchers.☆547Updated 3 years ago
- Official TensorFlow code for the paper "Efficient-CapsNet: Capsule Network with Self-Attention Routing".☆272Updated 3 years ago
- Pytorch implementation of Masked Auto-Encoder☆40Updated 3 years ago
- PyTorch implementation of the GradNorm☆110Updated last year
- [IEEE Access] PyTorch implementation of DCSS: Deep Clustering with Self-supervision using Pairwise Data Similarities☆31Updated 2 years ago
- A PyTorch implementation of the TCAN model in "Temporal Convolutional Attention-based Network For Sequence Modeling".☆143Updated 2 years ago
- ☆364Updated 2 years ago
- This repository holds the code for the paper "Deep Conditional Gaussian Mixture Model forConstrained Clustering".☆34Updated 3 years ago
- Multi-task learning using uncertainty to weigh losses for scene geometry and semantics, Auxiliary Tasks in Multi-task Learning☆638Updated 5 years ago
- Experiments with supervised contrastive learning methods with different loss functions☆221Updated 2 years ago
- DANets (a family of neural networks) for tabular data classification/ regression.☆46Updated 3 years ago
- ☆154Updated 2 years ago
- PyTorch implementation of the InfoNCE loss for self-supervised learning.☆594Updated last year
- ☆72Updated 4 years ago
- PyTorch implementation of Graph Attention Networks☆21Updated 6 years ago
- ☆12Updated 4 years ago
- A pytorch implementation of the paper Unsupervised Deep Embedding for Clustering Analysis.☆146Updated 6 years ago
- Experimenting with different regression losses. Implemented in Pytorch.☆148Updated 6 years ago
- Recurrent neural networks: building a custom LSTM/GRU cell in PyTorch☆28Updated 5 years ago
- Basic implementation of ResNet 50, 101, 152 in PyTorch☆119Updated 3 years ago