Datasets and Evaluation Scripts for CompHRDoc
☆57Feb 25, 2025Updated last year
Alternatives and similar repositories for CompHRDoc
Users that are interested in CompHRDoc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Dataset and scripts for HRDoc☆41Jun 21, 2023Updated 2 years ago
- This repo is used to release the ArxivFormula dataset.☆35Nov 12, 2024Updated last year
- ☆41Jun 15, 2024Updated last year
- ☆37Jan 26, 2026Updated 2 months ago
- A full codebase for replicating the results of Nougat from downloading arXiv dataset to the final evaluation. It also contains a few fixe…☆11Dec 11, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆39Mar 26, 2025Updated last year
- ICDAR 2024 Table OCR Model☆39Feb 25, 2026Updated last month
- A curated collection of projects, benchmarks, and research papers focused on reproducing and advancing the DeepSeek R1 framework.☆15Mar 19, 2025Updated last year
- The official implement of CTRNet++.☆15Dec 30, 2024Updated last year
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆17Mar 20, 2024Updated 2 years ago
- ☆142Feb 13, 2024Updated 2 years ago
- [AAAI 2025 (Oral)] SAIL: Sample-Centric In-Context Learning for Document Information Extraction☆19Dec 24, 2024Updated last year
- ☆25Oct 20, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆15Nov 14, 2022Updated 3 years ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,825Mar 17, 2026Updated 3 weeks ago
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆160Sep 25, 2024Updated last year
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆42Oct 6, 2023Updated 2 years ago
- ☆42Sep 2, 2023Updated 2 years ago
- ☆32Apr 14, 2024Updated last year
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆2,085Apr 14, 2025Updated 11 months ago
- DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models☆153Jan 13, 2025Updated last year
- [ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation☆15Sep 6, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Algorithms, papers, datasets, performance comparisons for Document AI.☆206Mar 1, 2025Updated last year
- This is an accurate implementation for IoU loss between two rotated polygons. This algorithm is accurate and differential, but there is n…☆18Mar 5, 2022Updated 4 years ago
- ☆14Sep 6, 2024Updated last year
- Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.☆18Apr 23, 2023Updated 2 years ago
- Repository for NLP project. Name to be changed when we decide on a project☆16Apr 19, 2022Updated 3 years ago
- ☆20Apr 8, 2025Updated last year
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆320Aug 15, 2025Updated 7 months ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆54Aug 8, 2023Updated 2 years ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆428Feb 1, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆102Dec 23, 2024Updated last year
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆117Aug 26, 2024Updated last year
- AdaptKeyBERT: keyword/keyphrase extraction with zero-shot and few-shot semi-supervised domain adaptation☆26Sep 22, 2024Updated last year
- Official implementation of URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding (AAAI 2026…☆37Feb 4, 2026Updated 2 months ago
- ☆19Sep 1, 2022Updated 3 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Nov 25, 2022Updated 3 years ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆275Dec 6, 2025Updated 4 months ago