The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.
☆26Feb 22, 2024Updated 2 years ago
Alternatives and similar repositories for SimChart9K
Users that are interested in SimChart9K are comparing it to the libraries listed below
Sorting:
- Official repository for "TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving"☆23Sep 1, 2025Updated 6 months ago
- [CVPR'24] Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression☆15Jul 1, 2024Updated last year
- ☆85Aug 18, 2024Updated last year
- ☆15May 15, 2025Updated 10 months ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year
- Official Repository of ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning☆252Sep 26, 2024Updated last year
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆84Jun 20, 2023Updated 2 years ago
- ☆27Jul 6, 2024Updated last year
- A huge dataset for Document Visual Question Answering☆20Jul 29, 2024Updated last year
- Joint Distribution Alignment via Adversarial Learning for Domain Adaptive Object Detection☆11Mar 10, 2022Updated 4 years ago
- [NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy☆73Jan 22, 2025Updated last year
- VisuRiddles: Fine-grained Perception is a important thing for Multimodal Large Models in Riddles Solving☆18Oct 22, 2025Updated 4 months ago
- DVQA Dataset: A Bar chart question answering dataset presented at CVPR 2018☆38Jun 24, 2019Updated 6 years ago
- Increasing the scale and diversity of chart de-rendering data.☆12Mar 13, 2024Updated 2 years ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆46Jun 11, 2024Updated last year
- [ACL 2024] ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.☆132Sep 7, 2024Updated last year
- MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts☆355Sep 29, 2025Updated 5 months ago
- ☆134Dec 22, 2023Updated 2 years ago
- OpenMMLab Detection Toolbox and Benchmark for V3Det☆15Apr 3, 2024Updated last year
- ☆73Jul 14, 2024Updated last year
- VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition☆69Oct 1, 2022Updated 3 years ago
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆95Jan 7, 2025Updated last year
- ☆256Dec 7, 2023Updated 2 years ago
- Curriculum-style Local-to-global Adaptation for Cross-domain Remote Sensing Image Segmentation☆44Sep 11, 2023Updated 2 years ago
- ☆244Apr 18, 2025Updated 11 months ago
- Chart-to-Text: Generating Natural Language Explanations for Charts by Adapting the Transformer Model☆159Jul 23, 2023Updated 2 years ago
- ☆23Aug 17, 2024Updated last year
- [ICCV 2023] ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules☆28Jun 3, 2024Updated last year
- ☆125Jul 14, 2024Updated last year
- [Paper] Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document"☆17Dec 1, 2023Updated 2 years ago
- Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs☆17Mar 2, 2020Updated 6 years ago
- DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models☆152Jan 13, 2025Updated last year
- v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning☆19Oct 6, 2025Updated 5 months ago
- Official repository of MMDU dataset☆104Sep 29, 2024Updated last year
- [T-PAMI'23] PAGCP for the compression of YOLOv5☆121Apr 13, 2023Updated 2 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Jun 13, 2023Updated 2 years ago
- ☆14Jun 10, 2025Updated 9 months ago
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Sep 5, 2023Updated 2 years ago