uakarsh / latrView on GitHub
Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answering (STVQA)
55Oct 30, 2024Updated last year

Alternatives and similar repositories for latr

Users that are interested in latr are comparing it to the libraries listed below

Sorting:

Are these results useful?