lucidrains / bidirectional-cross-attention
A simple cross attention that updates both the source and target in one step
☆164Updated 10 months ago
Alternatives and similar repositories for bidirectional-cross-attention:
Users that are interested in bidirectional-cross-attention are comparing it to the libraries listed below
- Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification☆197Updated 3 years ago
- Implementation of Zorro, Masked Multimodal Transformer, in Pytorch☆97Updated last year
- [T-PAMI] A curated list of self-supervised multimodal learning resources.☆248Updated 7 months ago
- Transformers w/o Attention, based fully on MLPs☆93Updated 11 months ago
- Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding☆45Updated 5 months ago
- (ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"☆106Updated last year
- Implementation of Deformable Attention in Pytorch from the paper "Vision Transformer with Deformable Attention"☆330Updated last month
- ☆191Updated 2 years ago
- [NeurIPS 2023, Spotlight] Rank-N-Contrast: Learning Continuous Representations for Regression☆107Updated last year
- An implementation of local windowed attention for language modeling☆429Updated 2 months ago
- Implementation of Linformer for Pytorch☆274Updated last year
- An implementation of the efficient attention module.☆305Updated 4 years ago
- Official Open Source code for "Masked Autoencoders As Spatiotemporal Learners"☆331Updated 4 months ago
- Recent Advances in MLP-based Models (MLP is all you need!)☆114Updated 2 years ago
- Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorc…☆304Updated 3 years ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆104Updated 3 months ago
- Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation☆102Updated last month
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆103Updated last year
- [CVPR'23] Hard Patches Mining for Masked Image Modeling☆90Updated last year
- ☆154Updated 2 years ago
- Sequencer: Deep LSTM for Image Classification☆141Updated 2 years ago
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆99Updated 2 years ago
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch☆37Updated 3 years ago
- PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes☆59Updated 10 months ago
- [NeurIPS 2021] Official codes for "Efficient Training of Visual Transformers with Small Datasets".☆141Updated 2 months ago
- iFormer: Inception Transformer☆244Updated 2 years ago
- PyTorch implementations of KMeans, Soft-KMeans and Constrained-KMeans which can be run on GPU and work on (mini-)batches of data.☆63Updated 2 years ago
- MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022☆570Updated 2 years ago
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆125Updated last month
- [ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention☆188Updated 2 years ago