phucty / wtabhtmlLinks
Tool to parse wiki tables from the HTML dump of Wikipedia
☆11Updated 3 years ago
Alternatives and similar repositories for wtabhtml
Users that are interested in wtabhtml are comparing it to the libraries listed below
Sorting:
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆32Updated 3 months ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Updated 2 years ago
- An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…☆53Updated last year
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30Updated 2 years ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆54Updated 2 years ago
- Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)☆125Updated last year
- An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"☆80Updated last year
- CTE: Contextualized Table Extraction Dataset