mitvis / vistext
VisText is a benchmark dataset for semantically rich chart captioning.
☆83Updated last year
Related projects ⓘ
Alternatives and complementary repositories for vistext
- ☆102Updated 3 months ago
- ☆10Updated last year
- ☆61Updated 2 months ago
- Vega-Lite Chart Dataset and NL Generation Framework using LLMs☆102Updated 5 months ago
- SciCap Dataset☆48Updated 3 years ago
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆69Updated last year
- Official implementation for <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>, accepted by ACL 2024. It a…☆35Updated last week
- ☆165Updated 3 months ago
- ☆11Updated last year
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers"☆40Updated last month
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆46Updated 3 weeks ago
- Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"☆23Updated 5 months ago
- PAIR.withgoogle.com and friend's work on interpretability methods☆148Updated last week
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆59Updated 6 months ago
- [ICPRAI 2024] DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents☆16Updated 7 months ago
- Chart-to-Text: Generating Natural Language Explanations for Charts by Adapting the Transformer Model☆149Updated last year
- CHI 2021 Paper Website☆10Updated 3 years ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆92Updated last month
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆57Updated 5 months ago
- Evaluating the Moral Beliefs Encoded in LLMs☆21Updated 9 months ago
- ☆28Updated 4 years ago
- ☆128Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆39Updated 4 months ago
- E5-V: Universal Embeddings with Multimodal Large Language Models☆167Updated 3 months ago
- Code for "Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models", ICLR 2024 Oral.☆20Updated 6 months ago
- [ACL 2024] ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.☆107Updated 2 months ago
- ☆83Updated last year
- ☆36Updated 3 months ago
- LLM Attributor: Attribute LLM's Generated Text to Training Data☆32Updated 4 months ago
- ☆36Updated 3 months ago