Datasets and Evaluation Scripts for CompHRDoc
☆58Feb 25, 2025Updated last year
Alternatives and similar repositories for CompHRDoc
Users that are interested in CompHRDoc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo is used to release the ArxivFormula dataset.☆35Nov 12, 2024Updated last year
- ☆42Jun 15, 2024Updated last year
- ☆161May 8, 2025Updated last year
- ☆37Jan 26, 2026Updated 3 months ago
- A full codebase for replicating the results of Nougat from downloading arXiv dataset to the final evaluation. It also contains a few fixe…☆11Dec 11, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆39Mar 26, 2025Updated last year
- ICDAR 2024 Table OCR Model☆39Feb 25, 2026Updated 2 months ago
- A curated collection of projects, benchmarks, and research papers focused on reproducing and advancing the DeepSeek R1 framework.☆15Mar 19, 2025Updated last year
- The official implement of CTRNet++.☆15Dec 30, 2024Updated last year
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆17Mar 20, 2024Updated 2 years ago
- ☆142Feb 13, 2024Updated 2 years ago
- TRACE: Table Reconstruction Aligned to Corner and Edges (ICDAR 2023)☆31Mar 13, 2024Updated 2 years ago
- [AAAI 2025 (Oral)] SAIL: Sample-Centric In-Context Learning for Document Information Extraction☆20Dec 24, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This is the official repository of the EMNLP 2023 paper Reading Order Matters: Information Extraction from Visually-rich Documents by Tok…☆18Mar 15, 2024Updated 2 years ago
- ☆25Oct 20, 2022Updated 3 years ago
- ☆15Nov 14, 2022Updated 3 years ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,828Mar 17, 2026Updated 2 months ago
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆160Sep 25, 2024Updated last year
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆43Oct 6, 2023Updated 2 years ago
- ☆42Sep 2, 2023Updated 2 years ago
- ☆32Apr 14, 2024Updated 2 years ago
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆2,152Apr 14, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models☆154Jan 13, 2025Updated last year
- [ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation☆15Sep 6, 2024Updated last year
- Algorithms, papers, datasets, performance comparisons for Document AI.☆208Mar 1, 2025Updated last year
- This is an accurate implementation for IoU loss between two rotated polygons. This algorithm is accurate and differential, but there is n…☆18Mar 5, 2022Updated 4 years ago
- ☆14Sep 6, 2024Updated last year
- Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.☆18Apr 23, 2023Updated 3 years ago
- Repository for NLP project. Name to be changed when we decide on a project☆16Apr 19, 2022Updated 4 years ago
- Tensorflow implementation of denseness☆14Mar 2, 2019Updated 7 years ago
- ☆20Apr 8, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆320Aug 15, 2025Updated 9 months ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆54Aug 8, 2023Updated 2 years ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆430Feb 1, 2023Updated 3 years ago
- ☆102Dec 23, 2024Updated last year
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆117Aug 26, 2024Updated last year
- [Neural Networks 2025] The official code for the paper "MNet: A Multi-Scale Network for Visible Watermark Removal."☆17Jun 16, 2025Updated 11 months ago
- AdaptKeyBERT: keyword/keyphrase extraction with zero-shot and few-shot semi-supervised domain adaptation☆26Sep 22, 2024Updated last year