pengyu965 / ChartDeteLinks
Context-Aware Chart Element Detection
☆49Updated last month
Alternatives and similar repositories for ChartDete
Users that are interested in ChartDete are comparing it to the libraries listed below
Sorting:
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆80Updated 2 years ago
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆133Updated last month
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆159Updated last year
- ☆67Updated last year
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆46Updated last year
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆72Updated 3 weeks ago
- ☆68Updated last year
- Line Chart Data Extraction: Official code for LineFormer - ICDAR23 Paper☆48Updated last year
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆80Updated 2 years ago
- ☆82Updated last year
- The WordScape repository contains code for the WordScape pipeline to create datasets to train document understanding models.☆37Updated last year
- Dataset and scripts for HRDoc☆40Updated 2 years ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆53Updated 11 months ago
- ☆145Updated 2 years ago
- Datasets and Evaluation Scripts for CompHRDoc☆53Updated 8 months ago
- [ACL 2024] ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.☆131Updated last year
- ☆25Updated last year
- ☆45Updated last year
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆101Updated 7 months ago
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆96Updated 10 months ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Updated 2 years ago
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆24Updated last year
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆43Updated last year
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆184Updated last year
- ☆87Updated last year
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆42Updated 2 years ago
- DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems☆56Updated last year
- SciCap Dataset☆56Updated 4 years ago
- ☆227Updated 7 months ago
- (ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆94Updated 4 months ago