ZeningLin / ViBERTgrid-PyTorchLinks

An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"

☆53

Alternatives and similar repositories for ViBERTgrid-PyTorch

Users that are interested in ViBERTgrid-PyTorch are comparing it to the libraries listed below

Sorting:

MAEHCM / ICL-D3IE
Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”
☆53Updated last year
ZZR8066 / GraphDoc
☆44Updated 3 years ago
adeline-cs / GTR
Scene text recognition
☆107Updated 3 years ago
amazon-science / glass-text-spotting
Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)
☆102Updated last year
jfkuang / CFAM
Contrast-guided Feature Adjustment Module for Visual Information Extraction
☆29Updated 2 years ago
abdoelsayed2016 / TNCR_Dataset
Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…
☆68Updated last year
L597383845 / row-col-table-recognition
time-series row column classification
☆14Updated 3 years ago
clovaai / spade
☆80Updated 2 years ago
furkanbiten / idl_data
OCR Annotations from Amazon Textract for Industry Documents Library
☆102Updated 2 years ago
wangwen-whu / WTW-Dataset
This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure …
☆179Updated 3 years ago
shengtao96 / CentripetalText
☆29Updated 2 years ago
wangyuxin87 / VisionLAN
A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)
☆106Updated 3 years ago
clovaai / units
☆79Updated 2 years ago
amazon-science / semimtr-text-recognition
Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)
☆83Updated last year
HCIILAB / EPHOIE
☆106Updated 4 years ago
FangShancheng / ABINet-PP
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting
☆86Updated 2 years ago
HCIILAB / LAST
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition
☆28Updated last year
CyrilSterling / LPV
The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)
☆26Updated last year
Wang-Tianwei / Implicit-feature-alignment
Pytorch implementation for "Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter".
☆67Updated 4 years ago
MaxKinny / TabRecSet
A large scale camera-taken table detection and recognition dataset.
☆136Updated 2 weeks ago
weijiawu / BOVText-Benchmark
[NeurIPS2021] BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting
☆68Updated last year
machine-intelligence-laboratory / DDI-100
Distorted Document Images dataset (DDI-100).
☆139Updated 2 years ago
bytedance / SPTSv2
The official implementation of SPTS v2: Single-Point Text Spotting
☆135Updated 2 years ago
NormXU / Layout2Graph
An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
☆80Updated last year
weijiawu / TransVTSpotter
A new video text spotting framework with Transformer
☆77Updated 3 years ago
shannanyinxiang / SPTS
Official implementation of SPTS: Single-Point Text Spotting (ACM MM 2022 Oral)
☆143Updated 2 years ago
SCUT-DLVCLab / RFUND
[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…
☆20Updated 8 months ago
IBM / SynthTabNet
Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files
☆147Updated 2 months ago
cndplab-founder / ctdar_measurement_tool
Evaluation Tool for the ICDAR 2019 Competition on Table Detection and Recognition
☆42Updated 3 years ago
namtuanly / WikiTableSet
WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia
☆30Updated last month