pengyu965 / ChartDeteLinks
Context-Aware Chart Element Detection
☆50Updated 4 months ago
Alternatives and similar repositories for ChartDete
Users that are interested in ChartDete are comparing it to the libraries listed below
Sorting:
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆136Updated 3 months ago
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆162Updated last year
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆46Updated last year
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆74Updated 2 weeks ago
- ☆148Updated 2 years ago
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆82Updated 2 years ago
- Dataset and scripts for HRDoc☆41Updated 2 years ago
- ☆67Updated 2 years ago
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆42Updated 2 years ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆77Updated 6 months ago
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆78Updated 2 years ago
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆103Updated 10 months ago
- Line Chart Data Extraction: Official code for LineFormer - ICDAR23 Paper☆54Updated 2 months ago
- ☆84Updated last year
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆53Updated last year
- ☆32Updated last year
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆23Updated last year
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Updated 3 years ago
- ☆87Updated 2 years ago
- The WordScape repository contains code for the WordScape pipeline to create datasets to train document understanding models.☆39Updated 2 years ago
- ☆71Updated last year
- Datasets and Evaluation Scripts for CompHRDoc☆55Updated 11 months ago
- [ACL 2024] ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.☆132Updated last year
- ☆45Updated last year
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆44Updated last year
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆84Updated 5 months ago
- ☆45Updated 3 years ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆76Updated last year
- Render documents on a virtual paper with folds and other types of damage using blender geometry nodes.☆26Updated 2 years ago
- Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents, CVPR 2025☆25Updated last year