lucidrains / bidirectional-cross-attentionLinks

A simple cross attention that updates both the source and target in one step

☆182

Alternatives and similar repositories for bidirectional-cross-attention

Users that are interested in bidirectional-cross-attention are comparing it to the libraries listed below

Sorting:

willGuimont / learnable_fourier_positional_encoding
Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding
☆55Updated last year
lucidrains / zorro-pytorch
Implementation of Zorro, Masked Multimodal Transformer, in Pytorch
☆96Updated 2 years ago
pliang279 / FactorCL
[NeurIPS 2023] Factorized Contrastive Learning: Going Beyond Multi-view Redundancy
☆71Updated last year
arxyzan / data2vec-pytorch
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
☆181Updated 2 years ago
rishikksh20 / CrossViT-pytorch
Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
☆205Updated 4 years ago
ys-zong / awesome-self-supervised-multimodal-learning
[T-PAMI] A curated list of self-supervised multimodal learning resources.
☆263Updated last year
kaiwenzha / Rank-N-Contrast
[NeurIPS 2023, Spotlight] Rank-N-Contrast: Learning Continuous Representations for Regression
☆126Updated last year
catalys1 / mae-pytorch
Simple MAE (masked autoencoders) with pytorch and pytorch-lightning.
☆44Updated last year
shlokk / mae-contrastive
Official implementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".
☆36Updated 2 years ago
salesforce / hierarchicalContrastiveLearning
☆161Updated 4 months ago
lucidrains / gradnorm-pytorch
A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch
☆110Updated 2 months ago
XzwHan / CARD
Official PyTorch implementation for the paper "CARD: Classification and Regression Diffusion Models"
☆232Updated 2 years ago
david-knigge / ccnn
Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…
☆183Updated 5 months ago
naver-ai / cl-vs-mim
(ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"
☆112Updated last year
fawazsammani / awesome-mlp-mixer
Transformers w/o Attention, based fully on MLPs
☆95Updated last year
cmsflash / efficient-attention
An implementation of the efficient attention module.
☆321Updated 4 years ago
lucidrains / linformer
Implementation of Linformer for Pytorch
☆300Updated last year
bwconrad / flexivit
PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes
☆61Updated last year
htdt / hyp_metric
Hyperbolic Vision Transformers: Combining Improvements in Metric Learning | Official repository
☆202Updated 3 years ago
Haochen-Wang409 / HPM
[CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling & Bootstrap Masked Visual Modeling via Hard Patch Mining
☆103Updated 6 months ago
jacklishufan / Mamba-ND
Ofiicial Implementation for Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data
☆64Updated last year
jokofa / torch_kmeans
PyTorch implementations of KMeans, Soft-KMeans and Constrained-KMeans which can be run on GPU and work on (mini-)batches of data.
☆73Updated 2 years ago
young-geng / m3ae_public
Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation
☆103Updated 8 months ago
TonyLianLong / CrossMAE
Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders
☆121Updated 6 months ago
EPFL-VILAB / MultiMAE
MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022
☆598Updated 2 years ago
kahnchana / opl
Official repository for "Orthogonal Projection Loss" (ICCV'21)
☆125Updated 3 years ago
lucidrains / local-attention
An implementation of local windowed attention for language modeling
☆483Updated 3 months ago
miguelsvasco / gmc
Official Implementation of "Geometric Multimodal Contrastive Representation Learning" (https://arxiv.org/abs/2202.03390)
☆28Updated 9 months ago
facebookresearch / VICRegL
VICRegL official code base
☆231Updated 2 years ago
lucidrains / uniformer-pytorch
Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…
☆102Updated 3 years ago