dandelin / ViLT

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
1,422Updated 8 months ago

Alternatives and similar repositories for ViLT:

Users that are interested in ViLT are comparing it to the libraries listed below