☆19Jun 7, 2023Updated 2 years ago
Alternatives and similar repositories for ocr-arxiv-daily
Users that are interested in ocr-arxiv-daily are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30May 23, 2023Updated 2 years ago
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆24Mar 17, 2021Updated 5 years ago
- ☆18Apr 11, 2023Updated 3 years ago
- a dataset for camera-based table detection☆16Jul 30, 2021Updated 4 years ago
- CTE: Contextualized Table Extraction Dataset☆17Feb 23, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Repository to use/train segmentation models for document layout analysis☆19Jan 13, 2022Updated 4 years ago
- STIRER: A Unified Model for Low-Resolution Scene Text Image Recovery and Recognition -- ACMMM 2023☆14Dec 2, 2024Updated last year
- Dataset and scripts for HRDoc☆41Jun 21, 2023Updated 2 years ago
- ☆16Jan 30, 2022Updated 4 years ago
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆32Jun 12, 2025Updated 10 months ago
- ☆14May 26, 2023Updated 2 years ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Dec 6, 2022Updated 3 years ago
- Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.☆18Apr 23, 2023Updated 3 years ago
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆62Jan 11, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- ☆22May 5, 2021Updated 5 years ago
- A large scale camera-taken table detection and recognition dataset.☆149Apr 9, 2026Updated last month
- Locality-Aware Non-Maximum Suppression (C++ version)☆23Aug 31, 2021Updated 4 years ago
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆53Sep 19, 2022Updated 3 years ago
- Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes☆528Jul 20, 2025Updated 9 months ago
- convert pytorch trained yolo model to ncnn for Flexible deployment☆10Aug 30, 2018Updated 7 years ago
- Algorithms, papers, datasets, performance comparisons for Document AI.☆208Mar 1, 2025Updated last year
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆80Dec 2, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official Code for 'EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification' - NAACL 2022☆23May 9, 2022Updated 4 years ago
- OCR Annotations from Amazon Textract for Industry Documents Library☆103Aug 20, 2022Updated 3 years ago
- Simplified implementations of deep learning related works☆13Oct 6, 2016Updated 9 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆364Oct 31, 2022Updated 3 years ago
- Repository in Support of EAGLE Submission☆23Oct 11, 2025Updated 6 months ago
- [AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal P…☆65Jan 27, 2026Updated 3 months ago
- ☆14Nov 15, 2016Updated 9 years ago
- Tool to parse wiki tables from the HTML dump of Wikipedia☆11Jun 12, 2022Updated 3 years ago
- Code of "Incorporating long-range consistency in CNN-based texture generation"☆13Jan 12, 2017Updated 9 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild☆33Apr 16, 2024Updated 2 years ago
- ☆13Nov 8, 2022Updated 3 years ago
- ☆19Jul 7, 2025Updated 10 months ago
- Image-based table cell detection: a new dataset and an improved detection method.☆55Jul 2, 2020Updated 5 years ago
- Implementation of clDice - a Novel Connectivity-Preserving Loss Function for Vessel Segmentation (2019) in Keras/Tensorflow☆13Apr 22, 2020Updated 6 years ago
- Learning Imbalanced Datasets With Maximum Margin Losss☆12Jun 17, 2023Updated 2 years ago
- 3D Slicer extension for SegmentAnyBone developed by Mazurowski Lab☆15Feb 25, 2026Updated 2 months ago