RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
☆39Mar 26, 2025Updated last year
Alternatives and similar repositories for RoDLA
Users that are interested in RoDLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆41Jun 15, 2024Updated last year
- ☆23Jun 17, 2025Updated 9 months ago
- ☆157May 8, 2025Updated 10 months ago
- ☆10Sep 3, 2024Updated last year
- Official repository for paper "MATERobot: Material Recognition in Wearable Robotics for People with Visual Impairments", ICRA 2024, Best …☆16Mar 26, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [WACV 2025] High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer☆21Jan 14, 2026Updated 2 months ago
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- Datasets and Evaluation Scripts for CompHRDoc☆57Feb 25, 2025Updated last year
- ☆16May 14, 2024Updated last year
- This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.☆14Feb 24, 2022Updated 4 years ago
- Official Repository of RefChartQA: Grounding Visual Answer on Chart Images through Instruction Tuning☆14Jul 9, 2025Updated 8 months ago
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆13May 13, 2023Updated 2 years ago
- Official repository for paper "Scene-agnostic Pose Regression for Visual Localization" (SPR), CVPR 2025☆30Mar 26, 2025Updated last year
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- A curated list of resources on Document Layout Analysis☆11Aug 7, 2025Updated 7 months ago
- Official repository for paper "Open Panoramic Segmentation" (OPS), ECCV 2024☆33Oct 7, 2025Updated 5 months ago
- ☆21Mar 15, 2022Updated 4 years ago
- [CVPR'24] Handwritten Mathematical Expressions Generation (HMEG)☆31Jun 3, 2024Updated last year
- Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degra…☆42Mar 20, 2026Updated last week
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Sep 22, 2023Updated 2 years ago
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆51Aug 5, 2024Updated last year
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Mar 19, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- CTE: Contextualized Table Extraction Dataset☆17Feb 23, 2023Updated 3 years ago
- ☆100Aug 1, 2024Updated last year
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆29Apr 16, 2023Updated 2 years ago
- Code for the paper "UVDoc: Neural Grid-based Document Unwarping"☆202Jul 28, 2024Updated last year
- ☆44Jul 9, 2024Updated last year
- Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.☆18Apr 23, 2023Updated 2 years ago
- IEEE/CVF International Conference on Computer Vision Workshop (2023)☆17Feb 7, 2024Updated 2 years ago
- [AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Lear…☆509Mar 14, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)☆126Nov 13, 2023Updated 2 years ago
- ☆19Aug 13, 2024Updated last year
- The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""☆100Oct 20, 2025Updated 5 months ago
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆23Sep 17, 2024Updated last year
- 使用FastAPI构建发票识别系统后端服务,支持并发。使用ERFNet模型训练发票轮廓检测,进行畸变矫正,OCR识别,模板匹配,支持倾斜发票识别。准确率99.9%。☆13May 8, 2025Updated 10 months ago
- Repository for the KVP10k dataset☆22Sep 18, 2025Updated 6 months ago
- The source code for “Homophily-Related: Adaptive Hybrid Graph Filter for Multi-View Graph Clustering”☆10Apr 10, 2024Updated last year