uakarsh / latr

Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answering (STVQA)
52Updated 3 weeks ago

Related projects

Alternatives and complementary repositories for latr