pranonrahman / ChartSumm
ChartSum is a large scale benchmark for automatic chart to text summarization
☆11Updated last year
Alternatives and similar repositories for ChartSumm:
Users that are interested in ChartSumm are comparing it to the libraries listed below
- ☆109Updated 6 months ago
- ☆66Updated 5 months ago
- ☆175Updated 6 months ago
- MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning☆134Updated last year
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆89Updated 3 weeks ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆59Updated 2 months ago
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆72Updated last year
- A Self-Training Framework for Vision-Language Reasoning☆61Updated last week
- ☆16Updated last year
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆195Updated 10 months ago
- The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.☆21Updated 11 months ago
- ☆59Updated 7 months ago
- The official dataset of the flowvqa project.☆11Updated 10 months ago
- Paper collections of multi-modal LLM for Math/STEM/Code.☆63Updated this week
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆76Updated 7 months ago
- ☆11Updated last year
- MATH-Vision dataset and code to measure Multimodal Mathematical Reasoning capabilities.☆81Updated 3 months ago
- M-HalDetect Dataset Release☆20Updated last year
- MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU☆44Updated last year
- ☆38Updated last year
- [ACL 2024] This is the code repo for our ACL‘24 paper "MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module …☆36Updated 7 months ago
- ☆29Updated last year
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆97Updated 3 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆156Updated 7 months ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆64Updated 9 months ago
- ☆10Updated 7 months ago
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating☆88Updated last year
- 😎 curated list of awesome LMM hallucinations papers, methods & resources.☆147Updated 10 months ago
- ☆95Updated last year
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆161Updated 11 months ago