khuangaf / Awesome-Chart-Understanding
A curated list of recent and past chart understanding work based on our IEEE TKDE survey paper: From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models.
☆191Updated 2 weeks ago
Alternatives and similar repositories for Awesome-Chart-Understanding:
Users that are interested in Awesome-Chart-Understanding are comparing it to the libraries listed below
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆95Updated 2 months ago
- Document Artifical Intelligence☆154Updated 3 months ago
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆98Updated 2 months ago
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆188Updated 5 months ago
- ☆69Updated 6 months ago
- ☆180Updated 8 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆226Updated 3 weeks ago
- ☆131Updated last year
- ☆221Updated last year
- [NeurIPS DB Track, 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.☆90Updated this week
- Paper collections of multi-modal LLM for Math/STEM/Code.☆80Updated 3 weeks ago
- A Toolkit for Table-based Question Answering☆110Updated last year
- ☆262Updated 7 months ago
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆70Updated 7 months ago
- [ACL 2024] ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.☆113Updated 6 months ago
- MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning☆136Updated last year
- Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)☆133Updated 5 months ago
- ☆414Updated 5 months ago
- The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"☆144Updated last month
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆218Updated 3 weeks ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆165Updated 3 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆127Updated 8 months ago
- This is the official repository for Retrieval Augmented Visual Question Answering☆209Updated 2 months ago
- ☆109Updated 8 months ago
- Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.☆605Updated this week
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆472Updated last month
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆114Updated 4 months ago
- ☆253Updated last week
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆78Updated 8 months ago
- Code & Dataset for Paper: "Distill Visual Chart Reasoning Ability from LLMs to MLLMs"☆51Updated 4 months ago