uakarsh/TiLT-Implementation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/uakarsh/TiLT-Implementation)

uakarsh / TiLT-Implementation

Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.

☆18

Alternatives and similar repositories for TiLT-Implementation

Users that are interested in TiLT-Implementation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

allanj / LayoutLMv3-DocVQA
View on GitHub
Example codebase for fine-tuning layoutLMv3 on DocVQA
☆53Sep 19, 2022Updated 3 years ago
SCUT-DLVCLab / Document-AI-Recommendations
View on GitHub
Algorithms, papers, datasets, performance comparisons for Document AI.
☆209Mar 1, 2025Updated last year
IBM / KVP10k
View on GitHub
Repository for the KVP10k dataset
☆23Sep 18, 2025Updated 10 months ago
rubenpt91 / MP-DocVQA-Framework
View on GitHub
☆72Jan 9, 2024Updated 2 years ago
saifullah3396 / docxclassifier
View on GitHub
☆17Jul 11, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AILab-UniFI / cte-dataset
View on GitHub
CTE: Contextualized Table Extraction Dataset
☆17Feb 23, 2023Updated 3 years ago
hint-lab / doctrack
View on GitHub
Dataset for EMNLP'23 Paper "DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading"
☆11Oct 25, 2023Updated 2 years ago
wanghaisheng / ocr-arxiv-daily
View on GitHub
☆19Jun 7, 2023Updated 3 years ago
shabie / docformer
View on GitHub
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…
☆290Feb 13, 2023Updated 3 years ago
LukeForeverYoung / UReader
View on GitHub
☆142Feb 13, 2024Updated 2 years ago
A-Ijishakin / Contrast-DiffAE
View on GitHub
☆15Aug 8, 2023Updated 2 years ago
nttmdlab-nlp / SlideVQA
View on GitHub
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)
☆106Mar 31, 2025Updated last year
ZZR8066 / GraphDoc
View on GitHub
☆45Jul 18, 2022Updated 4 years ago
MapariAbdullah / Llama2-Custom-document-QA
View on GitHub
Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer
☆15Oct 5, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yufanchen96 / RoDLA
View on GitHub
RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
☆39Mar 26, 2025Updated last year
abdoelsayed2016 / TNCR_Dataset
View on GitHub
Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…
☆68Feb 24, 2024Updated 2 years ago
uakarsh / docformerv2
View on GitHub
This repo consists of my implementation of DocFormerV2
☆12Mar 31, 2024Updated 2 years ago
uakarsh / latr
View on GitHub
Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…
☆56Oct 30, 2024Updated last year
microsoft / CompHRDoc
View on GitHub
Datasets and Evaluation Scripts for CompHRDoc
☆59Feb 25, 2025Updated last year
jfkuang / CFAM
View on GitHub
Contrast-guided Feature Adjustment Module for Visual Information Extraction
☆30May 23, 2023Updated 3 years ago
allenai / mmda
View on GitHub
multimodal document analysis
☆165May 14, 2026Updated 2 months ago
nilesc / Long-Structured-Debate-Generation-and-Evaluation
View on GitHub
☆13Dec 8, 2022Updated 3 years ago
L597383845 / row-col-table-recognition
View on GitHub
time-series row column classification
☆14Jan 7, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
IvanVassi / LaneNet-with-homography
View on GitHub
LaneNet with homography prediction pytorch implementation
☆10Feb 23, 2022Updated 4 years ago
qhnhynmm / ViOCRVQA-Dataset
View on GitHub
The largest VQA dataset for Vietnamese. Related to the text content in the image.
☆19Apr 9, 2025Updated last year
NExTplusplus / TAT-DQA
View on GitHub
TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning
☆24Sep 17, 2024Updated last year
deepopinion / anls_star_metric
View on GitHub
Official implementation of the ANLS* metric
☆25Jul 13, 2026Updated last week
SiddharthRajpal / HealthVision
View on GitHub
This project uses deep learning algorithms and the Keras library to determine if a person has certain diseases or not from their chest x-…
☆10Nov 18, 2025Updated 8 months ago
DS3Lab / TableParser
View on GitHub
Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at SDU@AAAI-22
☆15Aug 3, 2023Updated 2 years ago
showlab / assistgui
View on GitHub
☆30Apr 16, 2024Updated 2 years ago
WenjinW / LATIN-Prompt
View on GitHub
☆52May 28, 2024Updated 2 years ago
MAEHCM / ICL-D3IE
View on GitHub
Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”
☆54Aug 8, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
herobd / dessurt
View on GitHub
Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer
☆62Jan 11, 2023Updated 3 years ago
Shreeshrii / ocr-evaluation-tools
View on GitHub
☆16Mar 24, 2021Updated 5 years ago
chongyangtao / LLMs-for-NLG-Evaluation
View on GitHub
Awesome LLM for NLG Evaluation Papers
☆26Jan 23, 2024Updated 2 years ago
abdoelsayed2016 / Table-Detection-Structure-Recognition
View on GitHub
https://dl.acm.org/doi/10.1145/3657281
☆97Apr 25, 2024Updated 2 years ago
IDLabMedia / mvs-splatting
View on GitHub
Uses MVS depth estimation to initialize the 3D Gaussian splats instead of the Colmap SfM sparse point cloud.
☆15Nov 27, 2025Updated 7 months ago
doc-analysis / ReadingBank
View on GitHub
ReadingBank: A Benchmark Dataset for Reading Order Detection
☆117Aug 26, 2024Updated last year
HaoranREN / TensorFlow_Model_Quantization
View on GitHub
A tutorial of model quantization using TensorFlow
☆11Aug 2, 2021Updated 4 years ago