xiaojino / RUArtView external linksLinks
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering
☆10Nov 27, 2022Updated 3 years ago
Alternatives and similar repositories for RUArt
Users that are interested in RUArt are comparing it to the libraries listed below
Sorting:
- Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…☆55Oct 30, 2024Updated last year
- Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration (CVPR2023)"☆10May 15, 2024Updated last year
- Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.☆65Sep 15, 2021Updated 4 years ago
- TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)☆72May 22, 2023Updated 2 years ago
- ☆188May 8, 2024Updated last year
- The imdb files with SBD-Trans OCR for TextVQA dataset.☆11Nov 30, 2021Updated 4 years ago
- ☆14May 26, 2023Updated 2 years ago
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Feb 16, 2023Updated 3 years ago
- Scene text rectification using glyph and character alignment properties☆21Jan 21, 2018Updated 8 years ago
- A Better Way to Attend: Attention with Trees for Video Question Answering☆25Mar 25, 2019Updated 6 years ago
- Attention-based sampler in TASN (Trilinear Attention Sampling Network)☆23Jun 8, 2020Updated 5 years ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.☆28Jul 6, 2022Updated 3 years ago
- Local self-attention in Transformer for visual question answering☆13Mar 17, 2024Updated last year
- A clone from Max Jaderberg's Text Renderer☆34Jun 16, 2016Updated 9 years ago
- Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"☆31Nov 24, 2021Updated 4 years ago
- Code for pre-training CharacterBERT models (as well as BERT models).☆34Sep 6, 2021Updated 4 years ago
- ☆11Aug 15, 2021Updated 4 years ago
- 毕业设计: 基于深度学习的视觉问答☆14Jun 20, 2018Updated 7 years ago
- A simple toolkit for processing event-based data.☆13Dec 13, 2025Updated 2 months ago
- Fine-Grained Knowledge Fusion for Retrieval-Augmented Medical Visual Question☆11Jul 18, 2024Updated last year
- Python implementation of the Snake (active contours) algorithm proposed by KASS, WITKIN and TERZOPOULOS in 1988.☆34Nov 25, 2019Updated 6 years ago
- Directed masked autoencoders☆14Feb 5, 2026Updated last week
- A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)☆106Dec 9, 2021Updated 4 years ago
- [ICLR 2023] RC-MAE☆53Dec 18, 2023Updated 2 years ago
- Tutorial demonstrating how to leverage Pytorch and its features to carry out Information Extraction.☆11Dec 1, 2020Updated 5 years ago
- [ACM MM 2020] Exploring Font-independent Features for Scene Text Recognition☆44Nov 30, 2020Updated 5 years ago
- ☆12Mar 7, 2019Updated 6 years ago
- Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation☆19Nov 28, 2022Updated 3 years ago
- HTML5 Application to manipulate a Coons Bicubic Surface in 3D using its corner points, U and W tangents and UW twists.☆10Aug 19, 2019Updated 6 years ago
- ☆12Aug 25, 2017Updated 8 years ago
- ☆10Nov 15, 2021Updated 4 years ago
- ☆13Jul 28, 2024Updated last year
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- STVQA and TextVQA OCR results from Amazon Text in Image pipeline☆11Jul 18, 2022Updated 3 years ago
- Graph Key Information Extraction: GKIE☆11Sep 15, 2022Updated 3 years ago
- Convert pdf to pages of images☆13Apr 18, 2020Updated 5 years ago
- the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering☆13Aug 22, 2023Updated 2 years ago
- ☆12Jun 11, 2023Updated 2 years ago