wanghaisheng / ocr-arxiv-dailyView external linksLinks
☆18Jun 7, 2023Updated 2 years ago
Alternatives and similar repositories for ocr-arxiv-daily
Users that are interested in ocr-arxiv-daily are comparing it to the libraries listed below
Sorting:
- ☆10May 25, 2022Updated 3 years ago
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆25Mar 17, 2021Updated 4 years ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30May 23, 2023Updated 2 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Mar 4, 2022Updated 3 years ago
- ☆14May 26, 2023Updated 2 years ago
- STIRER: A Unified Model for Low-Resolution Scene Text Image Recovery and Recognition -- ACMMM 2023☆14Dec 2, 2024Updated last year
- ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for us…☆62Mar 12, 2025Updated 11 months ago
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆32Jun 12, 2025Updated 8 months ago
- Dataset and scripts for HRDoc☆41Jun 21, 2023Updated 2 years ago
- a dataset for camera-based table detection☆16Jul 30, 2021Updated 4 years ago
- Repository to use/train segmentation models for document layout analysis☆19Jan 13, 2022Updated 4 years ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Dec 6, 2022Updated 3 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 2 years ago
- Locality-Aware Non-Maximum Suppression (C++ version)☆23Aug 31, 2021Updated 4 years ago
- ☆21Mar 15, 2022Updated 3 years ago
- ☆18Apr 11, 2023Updated 2 years ago
- A large scale camera-taken table detection and recognition dataset.☆149Jul 21, 2025Updated 6 months ago
- Ideographic Description Sequence Checker Tools☆25Jun 21, 2017Updated 8 years ago
- Official Code for 'EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification' - NAACL 2022☆23May 9, 2022Updated 3 years ago
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆62Jan 11, 2023Updated 3 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- A python implementation to extract data in structured form from an image of an invoice☆30Sep 7, 2020Updated 5 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆361Oct 31, 2022Updated 3 years ago
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild☆33Apr 16, 2024Updated last year
- The Learnable Typewriter: A Generative Approach to Text Line Analysis☆34Oct 31, 2024Updated last year
- Pytorch implementation for "Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter".☆67Jun 15, 2021Updated 4 years ago
- ☆38Feb 4, 2023Updated 3 years ago
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆80Dec 2, 2022Updated 3 years ago
- Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"☆31Nov 24, 2021Updated 4 years ago
- we explores the fascinating domain of text-to-image generation using the powerful capabilities of the Flux API. The objective is to trans…☆12Aug 14, 2024Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 3 months ago
- Vision-Language Models Toolbox: Your all-in-one solution for multimodal research and experimentation☆12Feb 16, 2025Updated last year
- Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes☆503Jul 20, 2025Updated 6 months ago
- The dataset used in the CVPR 2022 paper (SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Norm…☆34Jun 21, 2022Updated 3 years ago
- Project that regroup the state-of-the-art knowledge distillation approaches for unsupervised anomaly detection☆13Oct 10, 2025Updated 4 months ago
- ☆22Dec 23, 2025Updated last month
- PERT: A Progressively Region-based Network for Scene Text Removal (TIP2023)☆37Aug 11, 2023Updated 2 years ago
- ☆22Dec 11, 2025Updated 2 months ago