pranonrahman / ChartSummLinks
ChartSum is a large scale benchmark for automatic chart to text summarization
☆11Updated 2 years ago
Alternatives and similar repositories for ChartSumm
Users that are interested in ChartSumm are comparing it to the libraries listed below
Sorting:
- ☆80Updated 11 months ago
- ☆115Updated last year
- ☆215Updated 3 months ago
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆98Updated 7 months ago
- ☆46Updated 4 months ago
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆90Updated last year
- ☆59Updated last year
- ☆78Updated last year
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆80Updated 9 months ago
- ☆25Updated 2 weeks ago
- ☆14Updated last year
- MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning☆135Updated 2 years ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆165Updated last year
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆111Updated last month
- NeurIPS 2024: SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation☆13Updated 2 months ago
- Paper collections of multi-modal LLM for Math/STEM/Code.☆118Updated 3 weeks ago
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆218Updated last year
- [ICLR 2025] ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation☆117Updated last month
- [AAAI 2025]Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning☆37Updated 4 months ago
- M-HalDetect Dataset Release☆25Updated last year
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆90Updated last year
- The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.☆26Updated last year
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆124Updated 3 months ago
- A RLHF Infrastructure for Vision-Language Models☆180Updated 9 months ago
- ☆100Updated last year
- A Self-Training Framework for Vision-Language Reasoning☆80Updated 6 months ago
- ☆17Updated last year
- The official dataset of the flowvqa project.☆16Updated last year
- ☆48Updated 2 months ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆74Updated last year