Datasets and Evaluation Scripts for CompHRDoc
☆57Feb 25, 2025Updated last year
Alternatives and similar repositories for CompHRDoc
Users that are interested in CompHRDoc are comparing it to the libraries listed below
Sorting:
- Dataset and scripts for HRDoc☆41Jun 21, 2023Updated 2 years ago
- ☆41Jun 15, 2024Updated last year
- ☆157May 8, 2025Updated 10 months ago
- ☆37Jan 26, 2026Updated last month
- A full codebase for replicating the results of Nougat from downloading arXiv dataset to the final evaluation. It also contains a few fixe…☆11Dec 11, 2023Updated 2 years ago
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆39Mar 26, 2025Updated 11 months ago
- The official implement of CTRNet++.☆14Dec 30, 2024Updated last year
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆17Mar 20, 2024Updated 2 years ago
- ☆143Feb 13, 2024Updated 2 years ago
- TRACE: Table Reconstruction Aligned to Corner and Edges (ICDAR 2023)☆31Mar 13, 2024Updated 2 years ago
- [AAAI 2025 (Oral)] SAIL: Sample-Centric In-Context Learning for Document Information Extraction☆19Dec 24, 2024Updated last year
- ☆25Oct 20, 2022Updated 3 years ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,824Apr 9, 2025Updated 11 months ago
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆46Jun 11, 2024Updated last year
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆160Sep 25, 2024Updated last year
- ☆42Sep 2, 2023Updated 2 years ago
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆42Oct 6, 2023Updated 2 years ago
- ☆32Apr 14, 2024Updated last year
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆2,048Apr 14, 2025Updated 11 months ago
- DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models☆152Jan 13, 2025Updated last year
- [ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation☆15Sep 6, 2024Updated last year
- Algorithms, papers, datasets, performance comparisons for Document AI.☆205Mar 1, 2025Updated last year
- This is an accurate implementation for IoU loss between two rotated polygons. This algorithm is accurate and differential, but there is n…☆18Mar 5, 2022Updated 4 years ago
- ☆14Sep 6, 2024Updated last year
- Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.☆18Apr 23, 2023Updated 2 years ago
- Repository for NLP project. Name to be changed when we decide on a project☆16Apr 19, 2022Updated 3 years ago
- Tensorflow implementation of denseness☆14Mar 2, 2019Updated 7 years ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆315Aug 15, 2025Updated 7 months ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆54Aug 8, 2023Updated 2 years ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆423Feb 1, 2023Updated 3 years ago
- ☆102Dec 23, 2024Updated last year
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆117Aug 26, 2024Updated last year
- AdaptKeyBERT: keyword/keyphrase extraction with zero-shot and few-shot semi-supervised domain adaptation☆25Sep 22, 2024Updated last year
- Official implementation of URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding (AAAI 2026…☆38Feb 4, 2026Updated last month
- ☆21Mar 15, 2022Updated 4 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Nov 25, 2022Updated 3 years ago
- The official repo for the technical report "Scalable Mask Annotation for Video Text Spotting"☆16May 3, 2023Updated 2 years ago
- Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator☆12Apr 28, 2024Updated last year