naver-ai / scob
Official Implementation of SCOB [ICCV 2023]
☆22Updated last year
Alternatives and similar repositories for scob:
Users that are interested in scob are comparing it to the libraries listed below
- Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023☆104Updated last year
- ☆37Updated 9 months ago
- Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features …☆69Updated last year
- ☆76Updated last year
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆28Updated last year
- Official implementation for Dessurt☆58Updated 2 years ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆51Updated last year
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Updated 2 years ago
- ☆81Updated last year
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆80Updated 2 years ago
- This dataset contains re-annotations of 4 popular Latin/English scene text recognition datasets.☆50Updated 4 years ago
- swin-transformer custom for OCR☆114Updated last year
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆81Updated last year
- Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)☆102Updated 7 months ago
- TRACE: Table Reconstruction Aligned to Corner and Edges (ICDAR 2023)☆22Updated 11 months ago
- A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)☆105Updated 3 years ago
- ☆36Updated last year
- ☆23Updated last year
- ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting☆82Updated 2 years ago
- Scene text recognition☆106Updated 2 years ago
- ☆18Updated last year
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆44Updated 8 months ago
- The official implementation of SPTS v2: Single-Point Text Spotting☆131Updated last year
- Official Tensorflow Implementation of SATRN (CVPR Workshop WTDDLE 2020)☆160Updated 4 years ago
- ☆58Updated 2 years ago
- The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer☆49Updated 8 months ago
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆27Updated last year
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.☆28Updated 2 years ago
- CTE: Contextualized Table Extraction Dataset☆17Updated last year
- ☆42Updated 2 years ago