zhiqic / ChartReaderLinks
[ICCV 2023] ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules
☆22Updated last year
Alternatives and similar repositories for ChartReader
Users that are interested in ChartReader are comparing it to the libraries listed below
Sorting:
- A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo☆34Updated 9 months ago
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆46Updated 11 months ago
- ☆51Updated last year
- ☆32Updated last year
- ☆25Updated 11 months ago
- ☆64Updated last year
- ☆65Updated last year
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆59Updated 3 weeks ago
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆77Updated last year
- ☆41Updated last year
- Matryoshka Multimodal Models☆108Updated 4 months ago
- ☆22Updated 10 months ago
- ☆73Updated 9 months ago
- ☆17Updated last month
- [ACL'25 Main] ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation☆40Updated last week
- The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.☆25Updated last year
- Democratization of "PaLI: A Jointly-Scaled Multilingual Language-Image Model"☆90Updated last year
- The codebase for our EMNLP24 paper: Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Mo…☆78Updated 4 months ago
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆77Updated 6 months ago
- ACL 2025: Synthetic data generation pipelines for text-rich images.☆73Updated 3 months ago
- Dataset and scripts for HRDoc☆38Updated last year
- This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.or…☆125Updated 11 months ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆25Updated last year
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆82Updated 10 months ago
- [ACL 2024] ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.☆118Updated 9 months ago
- Official code and dataset for our EMNLP 2024 Findings paper: Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Kn…☆19Updated 5 months ago
- ☆133Updated last year
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆62Updated 7 months ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆66Updated 9 months ago
- ☆65Updated 10 months ago