samakos / Document-AI-
☆12Updated last year
Related projects: ⓘ
- ☆10Updated 2 weeks ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆17Updated last year
- ☆13Updated 9 months ago
- arXiv 23 "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs"☆11Updated 7 months ago
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆27Updated 3 weeks ago
- ☆19Updated 7 months ago
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆45Updated 3 months ago
- The official repo for the technical report "Scalable Mask Annotation for Video Text Spotting"☆17Updated last year
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆16Updated 11 months ago
- Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)☆11Updated 2 years ago
- [IJCAI2023] An official implement of the paper "Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement"☆51Updated last year
- ☆10Updated 10 months ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆72Updated last year
- [MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"☆13Updated 10 months ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆28Updated last year
- (ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer☆71Updated 5 months ago
- ☆10Updated 2 months ago
- ☆24Updated 10 months ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆79Updated last year
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Updated last year
- ☆17Updated this week
- [IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer☆101Updated 5 months ago
- The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)☆26Updated last year
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Updated 2 years ago
- Official Implementation of SCOB [ICCV 2023]☆22Updated 10 months ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Updated last year
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆38Updated 5 months ago
- The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer☆40Updated 3 months ago
- Official PyTorch implementation of "CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection"☆14Updated 5 months ago
- Python and JS tools to generate Printed LaTex formulas and images☆13Updated 10 months ago
- Masked Vision-Language Transformer in Fashion☆32Updated 11 months ago