LegalDocumentProcessing / FIR_Dataset_ICDAR2023Links
☆11Updated 2 years ago
Alternatives and similar repositories for FIR_Dataset_ICDAR2023
Users that are interested in FIR_Dataset_ICDAR2023 are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of STR models for transfer learning in Indic Languages☆16Updated 4 years ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Updated 3 years ago
- Official repository accompaying the ICDAR 2023 paper☆12Updated 2 years ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆21Updated 8 months ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30Updated 2 years ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Updated 3 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Updated 3 years ago
- Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration (CVPR2023)"☆10Updated last year
- A handwritten Chemical Structure Image data set named EDU-CHEMC, which consists of totally 52,987 handwritten molecular structure images …☆14Updated 7 months ago
- ☆45Updated 3 years ago
- [ACL 2025 🔥] A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding☆60Updated 7 months ago
- ☆18Updated 2 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Updated 3 years ago
- ☆17Updated 4 years ago
- Synthetic identity documents dataset☆30Updated 9 months ago
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆37Updated 2 years ago
- ☆16Updated 3 years ago
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Updated 2 years ago
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆32Updated 6 months ago
- This repository is created to share current progress of transformer based optical character recognition(OCR). Welcome to share~☆55Updated 2 years ago
- For ICDAR 2019 Paper on End-to-end License Plate and Scene Text Recognition with multi-head attention models☆25Updated 4 years ago
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆80Updated 3 years ago
- This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.☆14Updated 3 years ago
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆22Updated last week
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆76Updated last year
- Directed masked autoencoders☆14Updated 2 years ago
- Official PyTorch implementation of RIO☆19Updated 4 years ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Updated last year
- A comprehensive list [Hi-SAM@TPAMI'24, GoMatching@NeurIPS'24, DeepSolo(++)@ CVPR'23, DPText-DETR@AAAI'23, I3CL@IJCV'22] of our research w…☆92Updated last year
- Download flickr8k, flickr30k image caption datasets☆35Updated last year