guocheng2025 / Transformer-EncoderLinks
Implementation of Transformer encoder in PyTorch
☆69Updated 5 years ago
Alternatives and similar repositories for Transformer-Encoder
Users that are interested in Transformer-Encoder are comparing it to the libraries listed below
Sorting:
- Implement the paper "Self-Attention with Relative Position Representations"☆139Updated 4 years ago
- Transformer-based Conditional Variational Autoencoder for Controllable Story Generation☆159Updated 3 years ago
- PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI☆182Updated 2 years ago
- PyTorch implementation of the InfoNCE loss for self-supervised learning.☆594Updated last year
- pytorch; mask language model ; bert☆72Updated 5 years ago
- Multi-head attention in PyTorch☆154Updated 6 years ago
- A library for making Transformer Variational Autoencoders. (Extends the Huggingface/transformers library.)☆142Updated 4 years ago
- A simple cross attention that updates both the source and target in one step☆182Updated 3 months ago
- Independent implementation of Supervised Contrastive Loss. Straight to the point and beyond☆84Updated 4 years ago
- PyTorch implementation of some attentions for Deep Learning Researchers.☆547Updated 3 years ago
- Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.☆73Updated 4 years ago
- Experiments with supervised contrastive learning methods with different loss functions☆221Updated 2 years ago
- Multimodal Mixture-of-Experts VAE☆215Updated 2 years ago
- A Faster Pytorch Implementation of Multi-Head Self-Attention☆74Updated 3 years ago
- ☆72Updated 4 years ago
- Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning☆165Updated last year
- A pytorch &keras implementation and demo of Fastformer.☆190Updated 3 years ago
- ☆161Updated 4 months ago
- An (unofficial) implementation of Focal Loss, as described in the RetinaNet paper, generalized to the multi-class case.☆238Updated last year
- Code for "Finetuning Pretrained Transformers into Variational Autoencoders"☆39Updated 3 years ago
- ☆255Updated 4 years ago
- PyTorch implementation of some learning rate schedulers for deep learning researcher.☆91Updated 3 years ago
- Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms☆259Updated 4 years ago
- ☆212Updated 3 years ago
- Simple pytorch implementation of focal loss☆86Updated 2 years ago
- PyTorch implementation of the models described in the IEEE ICASSP 2022 paper "Is cross-attention preferable to self-attention for multi-m…☆61Updated 7 months ago
- This repository summarizes techniques for KL divergence vanishing problem.☆30Updated 6 years ago
- ☆196Updated 2 years ago
- Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information☆347Updated last year
- [NeurIPS 2023] Factorized Contrastive Learning: Going Beyond Multi-view Redundancy☆73Updated last year