LegalDocumentProcessing / FIR_Dataset_ICDAR2023Links
☆10Updated 2 years ago
Alternatives and similar repositories for FIR_Dataset_ICDAR2023
Users that are interested in FIR_Dataset_ICDAR2023 are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of STR models for transfer learning in Indic Languages☆16Updated 4 years ago
- Official repository accompaying the ICDAR 2023 paper☆12Updated 2 years ago
- Synthetic identity documents dataset☆28Updated 7 months ago
- Implementation of the DocLLM paper for Llama models.☆13Updated 6 months ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆21Updated 6 months ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30Updated 2 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Updated 2 years ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Updated 3 years ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆15Updated 11 months ago
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆37Updated 2 years ago
- A handwritten Chemical Structure Image data set named EDU-CHEMC, which consists of totally 52,987 handwritten molecular structure images …☆14Updated 5 months ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Updated 2 years ago
- ☆18Updated 2 years ago
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆79Updated 2 years ago
- This repository is created to share current progress of transformer based optical character recognition(OCR). Welcome to share~☆55Updated 2 years ago
- Download flickr8k, flickr30k image caption datasets☆30Updated last year
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆22Updated 7 months ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆81Updated 2 years ago
- ☆45Updated 3 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Updated 2 years ago
- Large-Scale Scene Text Dataset for Indic Languages☆18Updated 3 weeks ago
- Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration (CVPR2023)"☆10Updated last year
- This project was developed during 24hr Hackathon - Unscript 2k19. It is a service as telegram bot that takes infected skin image as input…☆16Updated 5 years ago
- ☆23Updated 10 months ago
- Library for converting from RGB / GrayScale image to base64 and back.☆19Updated 3 years ago
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Updated 2 years ago
- This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.☆14Updated 3 years ago
- ☆26Updated last year
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆31Updated 4 months ago
- Attention-based sequence-to-sequence model for handwritten word recognition☆62Updated last year