uakarsh / latr

Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answering (STVQA)
52Updated 3 months ago

Alternatives and similar repositories for latr:

Users that are interested in latr are comparing it to the libraries listed below