WenjinW/LATIN-Prompt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WenjinW/LATIN-Prompt)

WenjinW / LATIN-Prompt

☆52

Alternatives and similar repositories for LATIN-Prompt

Users that are interested in LATIN-Prompt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

due-benchmark / baselines
View on GitHub
The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."
☆36Mar 2, 2023Updated 3 years ago
herobd / dessurt
View on GitHub
Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer
☆62Jan 11, 2023Updated 3 years ago
ZZR8066 / GraphDoc
View on GitHub
☆45Jul 18, 2022Updated 4 years ago
huggingface / docmatix
View on GitHub
A huge dataset for Document Visual Question Answering
☆24Jul 29, 2024Updated last year
littletomatodonkey / Augment-XY-CUT
View on GitHub
an unofficial code for augment-XY-CUT in XYLayoutLM
☆30Jul 12, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
clovaai / webvicob
View on GitHub
Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023
☆110Oct 24, 2023Updated 2 years ago
rubenpt91 / MP-DocVQA-Framework
View on GitHub
☆72Jan 9, 2024Updated 2 years ago
mineshmathew / DocVQA
View on GitHub
baselines for DocVQA dataset
☆21Apr 11, 2021Updated 5 years ago
usydnlp / vdoc
View on GitHub
☆15Sep 7, 2022Updated 3 years ago
naver-ai / cream
View on GitHub
Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023
☆46Jun 11, 2024Updated 2 years ago
DS3Lab / TableParser
View on GitHub
Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at SDU@AAAI-22
☆15Aug 3, 2023Updated 2 years ago
clovaai / bros
View on GitHub
☆163Dec 27, 2022Updated 3 years ago
SCUT-DLVCLab / Document-AI-Recommendations
View on GitHub
Algorithms, papers, datasets, performance comparisons for Document AI.
☆209Mar 1, 2025Updated last year
lulia0228 / Document_IE
View on GitHub
GCN use for semi-construct document information extraction.
☆21Aug 5, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kyegomez / Kosmos2.5
View on GitHub
My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"
☆75Updated this week
google-research / pix2struct
View on GitHub
☆686Jul 8, 2026Updated 2 weeks ago
VinhLoiIT / vietnamese-htr
View on GitHub
Vietnamese handwritten text recognition system
☆18May 2, 2021Updated 5 years ago
NExTplusplus / TAT-DQA
View on GitHub
TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning
☆24Sep 17, 2024Updated last year
amazon-science / textadain-robust-recognition
View on GitHub
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
☆21Jul 26, 2022Updated 4 years ago
jpWang / LiLT
View on GitHub
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…
☆366Oct 31, 2022Updated 3 years ago
allanj / LayoutLMv3-DocVQA
View on GitHub
Example codebase for fine-tuning layoutLMv3 on DocVQA
☆53Sep 19, 2022Updated 3 years ago
uakarsh / latr
View on GitHub
Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…
☆56Updated this week
jfma-USTC / HRDoc
View on GitHub
Dataset and scripts for HRDoc
☆42Jun 21, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
luochuwei / Variational_method
View on GitHub
python code for my variational RNN method
☆14Jul 28, 2016Updated 9 years ago
LukeForeverYoung / UReader
View on GitHub
☆142Feb 13, 2024Updated 2 years ago
salesforce / QVR-SimpleDLM
View on GitHub
Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.
☆16May 1, 2025Updated last year
SCUT-DLVCLab / GPT-4V_OCR
View on GitHub
Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)
☆128Nov 13, 2023Updated 2 years ago
NormXU / ERNIE-Layout-Pytorch
View on GitHub
An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.
☆107Nov 15, 2023Updated 2 years ago
nttmdlab-nlp / SlideVQA
View on GitHub
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)
☆106Mar 31, 2025Updated last year
HCIILAB / M5HisDoc
View on GitHub
☆34Dec 18, 2025Updated 7 months ago
felix-schmitt / MathNet
View on GitHub
MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition
☆10Mar 19, 2025Updated last year
obensch / DocumentInformationExtraction
View on GitHub
Key Information Extraction From Documents: Evaluation And Generator
☆20Mar 10, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
HCIILAB / M6Doc
View on GitHub
☆164May 8, 2025Updated last year
furkanbiten / idl_data
View on GitHub
OCR Annotations from Amazon Textract for Industry Documents Library
☆103Aug 20, 2022Updated 3 years ago
uakarsh / TiLT-Implementation
View on GitHub
Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.
☆18Apr 23, 2023Updated 3 years ago
xinke-wang / Awesome-Text-VQA
View on GitHub
☆188May 8, 2024Updated 2 years ago
anisha2102 / docvqa
View on GitHub
Document Visual Question Answering
☆130Jul 30, 2020Updated 5 years ago
clovaai / cord
View on GitHub
CORD: A Consolidated Receipt Dataset for Post-OCR Parsing
☆485Jul 20, 2022Updated 4 years ago
herobd / FUDGE
View on GitHub
Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"
☆33Mar 4, 2022Updated 4 years ago