RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
☆39Mar 26, 2025Updated 11 months ago
Alternatives and similar repositories for RoDLA
Users that are interested in RoDLA are comparing it to the libraries listed below
Sorting:
- ☆40Jun 15, 2024Updated last year
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- Github repo for referring atomic video action recognition☆20Oct 2, 2024Updated last year
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆13May 13, 2023Updated 2 years ago
- This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.☆14Feb 24, 2022Updated 4 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Sep 22, 2023Updated 2 years ago
- Datasets and Evaluation Scripts for CompHRDoc☆56Feb 25, 2025Updated last year
- ☆157May 8, 2025Updated 10 months ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- ☆22Jun 17, 2025Updated 8 months ago
- ☆21Mar 15, 2022Updated 3 years ago
- Official Repository of RefChartQA: Grounding Visual Answer on Chart Images through Instruction Tuning☆14Jul 9, 2025Updated 7 months ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆75Sep 12, 2024Updated last year
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆51Aug 5, 2024Updated last year
- [WACV 2025] High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer☆21Jan 14, 2026Updated last month
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Mar 19, 2025Updated 11 months ago
- A curated list of resources on Document Layout Analysis☆11Aug 7, 2025Updated 7 months ago
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆23Sep 17, 2024Updated last year
- ☆23Dec 12, 2024Updated last year
- [CVPR'24] Handwritten Mathematical Expressions Generation (HMEG)☆30Jun 3, 2024Updated last year
- ☆10Sep 3, 2024Updated last year
- The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""☆98Oct 20, 2025Updated 4 months ago
- Unofficial implementation of Towards Accurate Scene Text Recognition with Semantic Reasoning Networks☆28Sep 24, 2021Updated 4 years ago
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆11Dec 1, 2022Updated 3 years ago
- Cross-lingual learning in scene text recognition (ICASSP2024)☆18Sep 29, 2024Updated last year
- Intuitive interface for fine-tuning and retraining a Tesseract OCR language model☆10Jul 4, 2025Updated 8 months ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Nov 25, 2022Updated 3 years ago
- ☆16Jan 30, 2022Updated 4 years ago
- Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degra…