datnnt1997 / multi-head_self-attention
A Faster Pytorch Implementation of Multi-Head Self-Attention
☆71Updated 2 years ago
Related projects: ⓘ
- Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification☆184Updated 3 years ago
- Multi-head attention in PyTorch☆146Updated 5 years ago
- Experiments with supervised contrastive learning methods with different loss functions☆211Updated last year
- Implementation of Transformer encoder in PyTorch☆57Updated 4 years ago
- Implement the paper "Self-Attention with Relative Position Representations"☆122Updated 3 years ago
- Simple pytorch implementation of focal loss☆79Updated last year
- Independent implementation of Supervised Contrastive Loss. Straight to the point and beyond☆74Updated 3 years ago
- Implementation of Swin Transformer with Pytorch☆103Updated 3 years ago
- Pseudo Labeling for Neural Networks and Logistic Regression/SVMs ( Based on "Pseudo-Label : The Simple and Efficient Semi-Supervised Lear…☆73Updated 4 years ago
- Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms☆249Updated 3 years ago
- Tensorflow implementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)☆206Updated 3 years ago
- PyTorch implementation of some learning rate schedulers for deep learning researcher.☆86Updated 2 years ago
- my codes for learning attention mechanism☆50Updated 4 years ago
- Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision☆207Updated 3 years ago
- A summarization of Transformer-based architectures for CV tasks, including image classification, object detection, segmentation, and Few-…☆104Updated 2 years ago
- A pytorch &keras implementation and demo of Fastformer.☆184Updated last year
- Official TensorFlow code for the paper "Efficient-CapsNet: Capsule Network with Self-Attention Routing".☆263Updated 2 years ago
- This is a repository for Multi-task learning with toy data in Pytorch and Tensorflow☆134Updated 5 years ago
- Squeeze and Excitation network implementation.☆17Updated 5 years ago
- Sequencer: Deep LSTM for Image Classification☆138Updated last year
- Implementation of Linformer for Pytorch☆244Updated 8 months ago
- Implementation of Vision Transformer from scratch and performance compared to standard CNNs (ResNets) and pre-trained ViT on CIFAR10 and …☆100Updated 5 months ago
- A simple cross attention that updates both the source and target in one step☆140Updated 4 months ago
- Example of PyTorch DistributedDataParallel☆59Updated 3 years ago
- Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…☆179Updated last year
- Library - Vanilla, ViT, DeiT, BERT, GPT☆62Updated 2 years ago
- ☆51Updated 4 years ago
- Learning and Building Convolutional Neural Networks using PyTorch☆198Updated 2 years ago
- PyTorch implementation of sparse autoencoder.☆29Updated 4 years ago
- Implementation of Visual Transformer for Small-size Datasets☆116Updated 2 years ago