yufanchen96 / RoDLAView external linksLinks
RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
☆38Mar 26, 2025Updated 10 months ago
Alternatives and similar repositories for RoDLA
Users that are interested in RoDLA are comparing it to the libraries listed below
Sorting:
- ☆40Jun 15, 2024Updated last year
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- Github repo for referring atomic video action recognition☆20Oct 2, 2024Updated last year
- This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.☆14Feb 24, 2022Updated 3 years ago
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆13May 13, 2023Updated 2 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Sep 22, 2023Updated 2 years ago
- Datasets and Evaluation Scripts for CompHRDoc☆56Feb 25, 2025Updated 11 months ago
- ☆156May 8, 2025Updated 9 months ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 2 years ago
- ☆22Jun 17, 2025Updated 7 months ago
- ☆21Mar 15, 2022Updated 3 years ago
- Official Repository of RefChartQA: Grounding Visual Answer on Chart Images through Instruction Tuning☆14Jul 9, 2025Updated 7 months ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆76Sep 12, 2024Updated last year
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Mar 19, 2025Updated 10 months ago
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆51Aug 5, 2024Updated last year
- [WACV 2025] High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer☆18Jan 14, 2026Updated last month
- A curated list of resources on Document Layout Analysis☆11Aug 7, 2025Updated 6 months ago
- [CVPR'24] Handwritten Mathematical Expressions Generation (HMEG)☆29Jun 3, 2024Updated last year
- ☆23Dec 12, 2024Updated last year
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆23Sep 17, 2024Updated last year
- ☆10Sep 3, 2024Updated last year
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""☆96Oct 20, 2025Updated 3 months ago
- Unofficial implementation of Towards Accurate Scene Text Recognition with Semantic Reasoning Networks☆28Sep 24, 2021Updated 4 years ago
- ☆16Jan 30, 2022Updated 4 years ago
- Cross-lingual learning in scene text recognition (ICASSP2024)☆18Sep 29, 2024Updated last year
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆10Dec 1, 2022Updated 3 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Nov 25, 2022Updated 3 years ago
- Intuitive interface for fine-tuning and retraining a Tesseract OCR language model☆10Jul 4, 2025Updated 7 months ago
- Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degra…☆39May 28, 2025Updated 8 months ago
- Official repository for paper "MATERobot: Material Recognition in Wearable Robotics for People with Visual Impairments", ICRA 2024, Best …☆16Mar 26, 2025Updated 10 months ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Dec 4, 2021Updated 4 years ago
- ☆17Jul 9, 2024Updated last year
- ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along w…☆331Aug 22, 2024Updated last year
- CTE: Contextualized Table Extraction Dataset☆17Feb 23, 2023Updated 2 years ago
- Official implementation of URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding (AAAI 2026…☆35Feb 4, 2026Updated last week
- The source codes of TDv2 in paper: TDv2: A Novel Tree-Structured Decoder for Offline Mathematical Expression Recognition.☆12Jul 28, 2022Updated 3 years ago
- ☆42Feb 7, 2023Updated 3 years ago