ShramanPramanick / VoLTA

Code release for "VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature Alignment" [TMLR, 2023]
13Updated 9 months ago

Related projects: