namtuanly / WikiTableSetLinks

WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia

☆30

Alternatives and similar repositories for WikiTableSet

Users that are interested in WikiTableSet are comparing it to the libraries listed below

Sorting:

jfkuang / CFAM
Contrast-guided Feature Adjustment Module for Visual Information Extraction
☆29Updated 2 years ago
HCIILAB / LAST
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition
☆28Updated last year
JG1VPP / MuTabNet
ICDAR 2024 Table OCR Model
☆36Updated 3 weeks ago
MAEHCM / ICL-D3IE
Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”
☆53Updated last year
onealwj / MVLT
PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition
☆29Updated 2 years ago
wzx99 / CLIPOCR
☆38Updated last year
poloclub / tsr-convstem
High-Performance Transformers for Table Structure Recognition Need Early Convolutions
☆45Updated last year
ZZR8066 / GraphDoc
☆44Updated 3 years ago
Xiaomeng-Yang / STR_benchmark_cleansed
☆14Updated 2 years ago
phucty / wtabhtml
Tool to parse wiki tables from the HTML dump of Wikipedia
☆11Updated 3 years ago
bytedance / SPTSv2
The official implementation of SPTS v2: Single-Point Text Spotting
☆135Updated 2 years ago
SCUT-DLVCLab / GPT-4V_OCR
Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)
☆125Updated last year
ZeningLin / PEneo
[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.
☆36Updated 4 months ago
Pay20Y / PIMNet
☆16Updated 3 years ago
LARS-research / TREFE
Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022
☆13Updated 2 years ago
ZeningLin / ViBERTgrid-PyTorch
An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…
☆53Updated last year
MelosY / CAM
☆25Updated last year
HCIILAB / M5HisDoc
☆30Updated last year
NormXU / Layout2Graph
An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
☆80Updated last year
zzyhlyoko / DCTC
☆42Updated last year
CyrilSterling / LPV
The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)
☆26Updated last year
FutureRising007 / Table_Structure_Recognition
Table Structure Recognition
☆76Updated 2 years ago
MaxKinny / TabRecSet
A large scale camera-taken table detection and recognition dataset.
☆136Updated 2 weeks ago
allanj / LayoutLMv3-DocVQA
Example codebase for fine-tuning layoutLMv3 on DocVQA
☆52Updated 2 years ago
cxfyxl / VIPTR
☆41Updated last year
abdoelsayed2016 / TNCR_Dataset
Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…
☆68Updated last year
amazon-science / glass-text-spotting
Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)
☆102Updated last year
NormXU / DocParser-Pytorch
An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents
☆37Updated last year
thanhnghiadk / syntactic_HME_generation
This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.
☆14Updated 3 years ago
IBM / SynthTabNet
Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files
☆147Updated 3 months ago