AI-Application-and-Integration-Lab / Scene-Text-Detection-And-Recognition-Model_M503Links
☆13Updated last year
Alternatives and similar repositories for Scene-Text-Detection-And-Recognition-Model_M503
Users that are interested in Scene-Text-Detection-And-Recognition-Model_M503 are comparing it to the libraries listed below
Sorting:
- ☆13Updated 2 years ago
- [ICIP 2024] Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model☆16Updated last month
- Scene-Text-Detection-And-Recognition-Model_M504☆25Updated 10 months ago
- [ICASSP 2025] PDSeg: Patch-Wise Distillation and Controllable Image Generation for Weakly-Supervised Histopathology Tissue Segmentation☆17Updated last month
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆23Updated 9 months ago
- ☆136Updated last year
- Document Artifical Intelligence☆175Updated 2 months ago
- An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…☆53Updated last year
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆60Updated 2 years ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆44Updated last year
- CTE: Contextualized Table Extraction Dataset☆17Updated 2 years ago
- Applied Deep Learning (2021 Spring) at National Taiwan University (NTU) CSIE☆9Updated 3 years ago
- Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)☆124Updated last year
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆53Updated last year
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆96Updated 5 months ago
- ☆206Updated 2 months ago
- https://dl.acm.org/doi/10.1145/3657281☆96Updated last year
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆204Updated 2 weeks ago
- Code & Dataset for Paper: "Distill Visual Chart Reasoning Ability from LLMs to MLLMs"☆54Updated 8 months ago
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆52Updated 2 years ago
- ☆41Updated last year
- ☆65Updated last year
- ☆75Updated 10 months ago
- A curated list of papers about key information extraction.☆96Updated 6 months ago
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆194Updated 3 months ago
- Official Implementation of TFLOP: Table Structure Recognition Framework with Layout Pointer Mechanism☆29Updated 3 weeks ago
- Datasets and Evaluation Scripts for CompHRDoc☆44Updated 4 months ago
- [EMNLP22] Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language Models☆22Updated 2 years ago
- Welcome to the Table Meets LLM repository! Code for paper "Table meets LLM" and "Tab4LLM".☆37Updated 5 months ago
- The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."☆36Updated 2 years ago