naver-ai / scobLinks
Official Implementation of SCOB [ICCV 2023]
☆22Updated last year
Alternatives and similar repositories for scob
Users that are interested in scob are comparing it to the libraries listed below
Sorting:
- Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023☆105Updated last year
- ☆41Updated last year
- Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features …☆69Updated 2 years ago
- ☆79Updated last year
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆46Updated last year
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆29Updated 2 years ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆44Updated last year
- ☆24Updated last year
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆83Updated last year
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆60Updated 2 years ago
- ☆30Updated last year
- Official Tensorflow Implementation of SATRN (CVPR Workshop WTDDLE 2020)☆159Updated 4 years ago
- Code for "Primitive Representation Learning for Scene Text Recognition" (CVPR 2021)☆83Updated 3 years ago
- This dataset contains re-annotations of 4 popular Latin/English scene text recognition datasets.☆50Updated 5 years ago
- Official Implementation of TFLOP: Table Structure Recognition Framework with Layout Pointer Mechanism☆29Updated 3 weeks ago
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆42Updated last year
- Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)☆101Updated 11 months ago
- An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…☆53Updated last year
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆53Updated last year
- ☆80Updated 2 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Updated 2 years ago
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆80Updated 2 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Updated 2 years ago
- A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)☆107Updated 3 years ago
- ☆38Updated last year
- [NeurIPS2021] BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting☆68Updated last year
- The dataset used in the CVPR 2022 paper (SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Norm…☆33Updated 3 years ago
- ☆18Updated 2 years ago
- swin-transformer custom for OCR☆114Updated last year
- ☆29Updated 2 years ago