zengxingchen / ChartQA-MLLM
[IEEE VIS 2024] LLaVA-Chart: Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction Tuning
☆50Updated last month
Related projects ⓘ
Alternatives and complementary repositories for ChartQA-MLLM
- Code & Dataset for Paper: "Distill Visual Chart Reasoning Ability from LLMs to MLLMs"☆30Updated 3 weeks ago
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning☆32Updated last month
- The first dense retrieval model that can be prompted like an LM☆63Updated 2 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated last month
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆38Updated last month
- Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models☆124Updated 3 weeks ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆103Updated 6 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆56Updated 5 months ago
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆56Updated last month
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆50Updated 7 months ago
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆30Updated this week
- Official Repo for UGround☆97Updated 2 weeks ago
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆40Updated 3 weeks ago
- SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights☆35Updated last month
- LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture☆179Updated last month
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆31Updated 4 months ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆47Updated last month
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers"☆40Updated last month
- ☆63Updated last month
- E5-V: Universal Embeddings with Multimodal Large Language Models☆173Updated 4 months ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆40Updated 9 months ago
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆46Updated 2 weeks ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆109Updated 3 months ago
- The Official Code Repository for GUI-World.☆41Updated 3 months ago
- ☆74Updated 8 months ago
- [EMNLP 2024] Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagati…☆88Updated 4 months ago
- ☆59Updated 5 months ago
- ☆42Updated 2 months ago
- A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆31Updated this week
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆25Updated last month