naver-ai / scob
Official Implementation of SCOB [ICCV 2023]
☆22Updated last year
Alternatives and similar repositories for scob:
Users that are interested in scob are comparing it to the libraries listed below
- Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023☆104Updated last year
- ☆37Updated 10 months ago
- Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features …☆69Updated last year
- ☆79Updated last year
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆28Updated last year
- Official implementation for Dessurt☆58Updated 2 years ago
- Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)☆122Updated last year
- ☆37Updated last year
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Updated 2 years ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆52Updated last year
- swin-transformer custom for OCR☆114Updated last year
- ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting☆83Updated 2 years ago
- ☆24Updated last year
- TRACE: Table Reconstruction Aligned to Corner and Edges (ICDAR 2023)☆23Updated last year
- [MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.☆28Updated 3 months ago
- ☆23Updated last year
- This dataset contains re-annotations of 4 popular Latin/English scene text recognition datasets.☆50Updated 5 years ago
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆45Updated 9 months ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆82Updated last year
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆28Updated last year
- CTE: Contextualized Table Extraction Dataset☆17Updated 2 years ago
- ☆81Updated last year
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆80Updated 2 years ago
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆39Updated last year
- ☆60Updated 2 years ago
- An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…☆54Updated last year
- Official Implementation of TFLOP: Table Structure Recognition Framework with Layout Pointer Mechanism☆23Updated this week
- ☆158Updated 2 years ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Updated 2 years ago
- The official implementation of SPTS v2: Single-Point Text Spotting☆131Updated last year