datnnt1997 / multi-head_self-attention
A Faster Pytorch Implementation of Multi-Head Self-Attention
☆71Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for multi-head_self-attention
- Multi-head attention in PyTorch☆148Updated 5 years ago
- my codes for learning attention mechanism☆50Updated 4 years ago
- An education step by step implementation of SimCLR that accompanies the blogpost☆32Updated 2 years ago
- Recurrent neural networks: building a custom LSTM/GRU cell in PyTorch☆28Updated 4 years ago
- Simple pytorch implementation of focal loss☆83Updated last year
- Experiments with supervised contrastive learning methods with different loss functions☆216Updated last year
- Independent implementation of Supervised Contrastive Loss. Straight to the point and beyond☆76Updated 3 years ago
- Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification☆189Updated 3 years ago
- Basic implementation of ResNet 50, 101, 152 in PyTorch☆94Updated 2 years ago
- Implementation of Transformer encoder in PyTorch☆60Updated 4 years ago
- Implementation of Swin Transformer with Pytorch☆107Updated 3 years ago
- Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms☆251Updated 3 years ago
- Tensorflow implementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)☆210Updated 4 years ago
- This is a re-implementation of TransGAN: Two Pure Transformers Can Make One Strong GAN (NeurIPS 2021) in PyTorch.☆109Updated 8 months ago
- Pytorch implementation of Masked Auto-Encoder☆38Updated 2 years ago
- Multi-task learning using uncertainty to weigh losses for scene geometry and semantics, Auxiliary Tasks in Multi-task Learning☆584Updated 4 years ago
- LSTM and GRU in PyTorch☆251Updated 5 years ago
- Implementation of the Graph Convolutional Networks in Pytorch☆30Updated 3 years ago
- Pseudo Labeling for Neural Networks and Logistic Regression/SVMs ( Based on "Pseudo-Label : The Simple and Efficient Semi-Supervised Lear…☆73Updated 4 years ago
- Learning Rate Warmup in PyTorch☆392Updated this week
- My implementation of the gMLP model from the paper "Pay Attention to MLPs".☆24Updated 3 years ago
- 关于Pytorch-Geometric的学习,包括官方文档的基本内容和部分API的使用方式,以及官方源码中的示例代码和Pytorch-Geometric的部分源码实现☆22Updated 3 years ago
- ☆19Updated 4 years ago
- Demonstrates knowledge distillation for image-based models in Keras.☆52Updated 3 years ago
- Implementation of Transformer model (originally from Attention is All You Need) applied to Time Series.☆27Updated 3 years ago
- Implementation of Linformer for Pytorch☆257Updated 10 months ago
- ☆144Updated 2 years ago
- PyTorch implementation of the InfoNCE loss for self-supervised learning.☆488Updated last year
- ☆15Updated 5 years ago
- An application of Self-Attention GANs and DCGAN on mnist dataset.☆48Updated 4 years ago