InsightsNet / texannotateLinks

TeX compilation service that makes use of arXiv.org's AutoTeX library.

☆34

Alternatives and similar repositories for texannotate

Users that are interested in texannotate are comparing it to the libraries listed below

Sorting:

IBM / SynthTabNet
Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files
☆147Updated 3 months ago
SCUT-DLVCLab / Document-AI-Recommendations
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
☆198Updated 5 months ago
MaxKinny / TabRecSet
A large scale camera-taken table detection and recognition dataset.
☆136Updated 3 weeks ago
NormXU / Layout2Graph
An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
☆80Updated last year
NormXU / ERNIE-Layout-Pytorch
An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.
☆106Updated last year
HCIILAB / M6Doc
☆143Updated 3 months ago
DS3Lab / WordScape
The WordScape repository contains code for the WordScape pipeline to create datasets to train document understanding models.
☆37Updated last year
doc-analysis / ReadingBank
ReadingBank: A Benchmark Dataset for Reading Order Detection
☆107Updated 11 months ago
SCUT-DLVCLab / GPT-4V_OCR
Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)
☆125Updated last year
ZeningLin / ViBERTgrid-PyTorch
An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…
☆53Updated last year
LukeForeverYoung / UReader
☆137Updated last year
LayTextLLM / LayTextLLM
☆96Updated 7 months ago
microsoft / ArxivFormula
This repo is used to release the ArxivFormula dataset.
☆31Updated 8 months ago
Sanster / xy-cut
☆87Updated 3 years ago
microsoft / CompHRDoc
Datasets and Evaluation Scripts for CompHRDoc
☆48Updated 5 months ago
allanj / LayoutLMv3-DocVQA
Example codebase for fine-tuning layoutLMv3 on DocVQA
☆52Updated 2 years ago
SCUT-DLVCLab / RFUND
[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…
☆20Updated 8 months ago
Ucas-HaoranWei / Vary-tiny-600k
Vary-tiny codebase upon LAVIS （for training from scratch）and a PDF image-text pairs data (about 600k including English/Chinese)
☆85Updated 10 months ago
namtuanly / WikiTableSet
WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia
☆30Updated last month
FutureRising007 / Table_Structure_Recognition
Table Structure Recognition
☆76Updated 2 years ago
MAEHCM / ICL-D3IE
Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”
☆53Updated 2 years ago
ZeningLin / PEneo
[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.
☆36Updated 4 months ago
andreagemelli / doc2graph
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
☆130Updated 2 years ago
rossumai / docile
DocILE: Document Information Localization and Extraction Benchmark
☆133Updated last year
guoxy25 / Ocean-OCR
☆37Updated 6 months ago
Yuxiang1995 / ICDAR2021_MFD
1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection（公式检测冠军方案）
☆132Updated last year
harrytea / Awesome-Document-Understanding
Document Artifical Intelligence
☆184Updated 3 months ago
kyegomez / Kosmos2.5
My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"
☆73Updated 3 weeks ago
furkanbiten / idl_data
OCR Annotations from Amazon Textract for Industry Documents Library
☆103Updated 2 years ago
clovaai / spade
☆80Updated 2 years ago