pengyu965 / ChartDete
Context-Aware Chart Element Detection
☆24Updated last year
Related projects: ⓘ
- Line Chart Data Extraction: Official code for LineFormer - ICDAR23 Paper☆23Updated 3 months ago
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆36Updated 11 months ago
- ☆52Updated 8 months ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Updated 2 years ago
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆43Updated 3 months ago
- A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers☆52Updated last week
- Official repository accompaying the ICDAR 2023 paper☆10Updated 11 months ago
- ☆40Updated 2 years ago
- Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)☆116Updated 10 months ago
- Official implementation for Dessurt☆56Updated last year
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆28Updated last year
- OCR Annotations from Amazon Textract for Industry Documents Library☆99Updated 2 years ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆38Updated 5 months ago
- CTE: Contextualized Table Extraction Dataset☆17Updated last year
- Datasets and Evaluation Scripts for CompHRDoc☆19Updated 5 months ago
- ☆48Updated 3 months ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆17Updated last year
- Dataset and scripts for HRDoc☆30Updated last year
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆69Updated last week
- ☆129Updated last year
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆121Updated 10 months ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆23Updated last year
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆69Updated 3 years ago
- Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, cand you can get the same (even better) result compared w…☆35Updated 2 months ago
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆68Updated last week
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 2 years ago
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆73Updated 11 months ago
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆66Updated last year
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆72Updated last year
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆33Updated last week