guocheng2025 / Transformer-EncoderLinks
Implementation of Transformer encoder in PyTorch
☆70Updated 5 years ago
Alternatives and similar repositories for Transformer-Encoder
Users that are interested in Transformer-Encoder are comparing it to the libraries listed below
Sorting:
- Implement the paper "Self-Attention with Relative Position Representations"☆139Updated 4 years ago
- A library for making Transformer Variational Autoencoders. (Extends the Huggingface/transformers library.)☆142Updated 4 years ago
- Transformer-based Conditional Variational Autoencoder for Controllable Story Generation☆160Updated 3 years ago
- PyTorch implementation of some attentions for Deep Learning Researchers.☆547Updated 3 years ago
- PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI☆183Updated 2 years ago
- Multi-head attention in PyTorch☆154Updated 6 years ago
- pytorch; mask language model ; bert☆72Updated 5 years ago
- Independent implementation of Supervised Contrastive Loss. Straight to the point and beyond☆85Updated 4 years ago
- PyTorch implementation of the InfoNCE loss for self-supervised learning.☆600Updated 2 years ago
- A simple cross attention that updates both the source and target in one step☆185Updated 3 months ago
- Experiments with supervised contrastive learning methods with different loss functions☆222Updated 2 years ago
- Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information☆349Updated last year
- Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning☆165Updated last year
- Contrastive Predictive Coding for Automatic Speaker Verification☆504Updated 6 years ago
- Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.☆73Updated 4 years ago
- Multimodal Mixture-of-Experts VAE☆219Updated 2 years ago
- ☆51Updated 5 years ago
- A pytorch &keras implementation and demo of Fastformer.☆190Updated 3 years ago
- An (unofficial) implementation of Focal Loss, as described in the RetinaNet paper, generalized to the multi-class case.☆238Updated last year
- Code for "Finetuning Pretrained Transformers into Variational Autoencoders"☆39Updated 3 years ago
- ☆212Updated 3 years ago
- ☆162Updated 5 months ago
- A minimal pytorch package implementing a gradient reversal layer.☆158Updated last year
- This repository summarizes techniques for KL divergence vanishing problem.☆30Updated 6 years ago
- ☆72Updated 4 years ago
- [ACL'19] [PyTorch] Multimodal Transformer☆930Updated 3 years ago
- Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms☆260Updated 4 years ago
- ☆259Updated 4 years ago
- Pytorch reimplementation for "Gradient Surgery for Multi-Task Learning"☆377Updated 4 years ago
- Learning Rate Warmup in PyTorch☆413Updated 5 months ago