yufanchen96/RoDLA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yufanchen96/RoDLA)

yufanchen96 / RoDLA

RoDLA: Benchmarking the Robustness of Document Layout Analysis Models

☆39

Alternatives and similar repositories for RoDLA

Users that are interested in RoDLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

johnning2333 / M2Doc
View on GitHub
☆43Jun 15, 2024Updated 2 years ago
KPeng9510 / RAVAR
View on GitHub
Github repo for referring atomic video action recognition
☆21Oct 2, 2024Updated last year
HCIILAB / M6Doc
View on GitHub
☆166May 8, 2025Updated last year
JunweiZheng93 / MATERobot
View on GitHub
Official repository for paper "MATERobot: Material Recognition in Wearable Robotics for People with Visual Impairments", ICRA 2024, Best …
☆16Mar 26, 2025Updated last year
FeiT-FeiTeng / OAFuser
View on GitHub
☆10Sep 3, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
RylonW / DocNLC
View on GitHub
Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degra…
☆44Mar 20, 2026Updated 4 months ago
KPeng9510 / RelaMiX
View on GitHub
☆19Aug 13, 2024Updated last year
ihdia / seamformer
View on GitHub
Official repository accompaying the ICDAR 2023 paper
☆14Oct 3, 2023Updated 2 years ago
KPeng9510 / OS-SAR
View on GitHub
☆16May 14, 2024Updated 2 years ago
JieHu1996 / DeformableMamba
View on GitHub
☆24Jun 17, 2025Updated last year
SII-sc22mc / DocFusion
View on GitHub
A Unified Framework for Document Parsing Tasks (Including Document Layout Analysis, OCR, Formula Recognition, and Table Recognition)
☆15Jul 1, 2025Updated last year
CXH-Research / StainRestorer
View on GitHub
[WACV 2025] High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer
☆23Jan 14, 2026Updated 6 months ago
thanhnghiadk / syntactic_HME_generation
View on GitHub
This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.
☆14Feb 24, 2022Updated 4 years ago
felix-schmitt / MathNet
View on GitHub
MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition
☆10Mar 19, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
moured / RefChartQA
View on GitHub
Official Repository of RefChartQA: Grounding Visual Answer on Chart Images through Instruction Tuning
☆14Jul 9, 2025Updated last year
callsys / FlowText
View on GitHub
[ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation
☆13May 13, 2023Updated 3 years ago
JunweiZheng93 / OPS
View on GitHub
Official repository for paper "Open Panoramic Segmentation" (OPS), ECCV 2024
☆37Oct 7, 2025Updated 9 months ago
microsoft / CompHRDoc
View on GitHub
Datasets and Evaluation Scripts for CompHRDoc
☆59Feb 25, 2025Updated last year
DCGM / SoftCTC
View on GitHub
This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135
☆19Mar 7, 2023Updated 3 years ago
onealwj / MVLT
View on GitHub
PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition
☆28Nov 11, 2022Updated 3 years ago
yufanchen96 / GraphDoc
View on GitHub
Graph-based Document Structure Analysis
☆18Mar 26, 2025Updated last year
qyhou / curated-document-layout-analysis
View on GitHub
A curated list of resources on Document Layout Analysis
☆12Aug 7, 2025Updated 11 months ago
VDIGPKU / STR_TPSearch
View on GitHub
☆21Mar 15, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Royalvice / DocDiff
View on GitHub
ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along w…
☆350Aug 22, 2024Updated last year
ThunderVVV / RCLSTR
View on GitHub
Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`
☆17Sep 22, 2023Updated 2 years ago
ayanban011 / SwinDocSegmenter
View on GitHub
[ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
☆74Sep 12, 2024Updated last year
ispamm / NAF-DPM
View on GitHub
NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement
☆54Aug 5, 2024Updated last year
UCSB-NLP-Chang / DiffSTE
View on GitHub
☆102Aug 1, 2024Updated last year
AILab-UniFI / cte-dataset
View on GitHub
CTE: Contextualized Table Extraction Dataset
☆17Feb 23, 2023Updated 3 years ago
CaseDrive / publaynet-models
View on GitHub
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
☆28Apr 16, 2023Updated 3 years ago
HCIILAB / LAST
View on GitHub
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition
☆28Aug 29, 2023Updated 2 years ago
buaacxf / VIPTR
View on GitHub
☆44Jul 9, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
uakarsh / TiLT-Implementation
View on GitHub
Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.
☆18Apr 23, 2023Updated 3 years ago
RuipingL / OpenSU
View on GitHub
IEEE/CVF International Conference on Computer Vision Workshop (2023)
☆17Feb 7, 2024Updated 2 years ago
yeungchenwa / FontDiffuser
View on GitHub
[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Lear…
☆540Mar 14, 2024Updated 2 years ago
SCUT-DLVCLab / GPT-4V_OCR
View on GitHub
Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)
☆128Nov 13, 2023Updated 2 years ago
omron-sinicx / scipostlayout
View on GitHub
☆25Jul 31, 2024Updated last year
NExTplusplus / TAT-DQA
View on GitHub
TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning
☆24Sep 17, 2024Updated last year
IBM / KVP10k
View on GitHub
Repository for the KVP10k dataset
☆23Sep 18, 2025Updated 10 months ago