naver-ai / scob
Official Implementation of SCOB [ICCV 2023]
☆22Updated last year
Alternatives and similar repositories for scob:
Users that are interested in scob are comparing it to the libraries listed below
- Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features …☆69Updated last year
- Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023☆104Updated last year
- ☆78Updated last year
- ☆37Updated last year
- ☆37Updated 9 months ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Updated 2 years ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆82Updated last year
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆28Updated last year
- ☆29Updated 8 months ago
- ☆23Updated last year
- ☆58Updated 2 years ago
- This dataset contains re-annotations of 4 popular Latin/English scene text recognition datasets.☆50Updated 4 years ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆52Updated last year
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆27Updated last year
- Scene text recognition☆106Updated 2 years ago
- [AAAI 2024] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression☆62Updated 2 weeks ago
- A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)☆105Updated 3 years ago
- An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…☆54Updated last year
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.☆28Updated 2 years ago
- Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)☆122Updated last year
- ☆23Updated last year
- The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)☆27Updated last year
- ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting☆83Updated 2 years ago
- ☆16Updated 3 years ago
- Code for "Primitive Representation Learning for Scene Text Recognition" (CVPR 2021)☆82Updated 2 years ago
- swin-transformer custom for OCR☆114Updated last year
- Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)☆102Updated 8 months ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆42Updated 11 months ago
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆32Updated last month
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆80Updated 2 years ago