zhiqic / ChartReader
[ICCV 2023] ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules
☆22Updated 11 months ago
Alternatives and similar repositories for ChartReader
Users that are interested in ChartReader are comparing it to the libraries listed below
Sorting:
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆45Updated 11 months ago
- ☆72Updated 9 months ago
- ☆64Updated last year
- ☆25Updated 10 months ago
- ☆38Updated 11 months ago
- A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo☆34Updated 9 months ago
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆77Updated last year
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆90Updated last month
- Synthetic data generation pipelines for text-rich images.☆67Updated 2 months ago
- Dataset and scripts for HRDoc☆37Updated last year
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆81Updated 2 years ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆24Updated last year
- Democratization of "PaLI: A Jointly-Scaled Multilingual Language-Image Model"☆90Updated last year
- ☆138Updated last year
- ☆32Updated last year
- ☆51Updated last year
- ☆15Updated last month
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆56Updated last month
- Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".☆57Updated last month
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆59Updated 2 years ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆72Updated 8 months ago
- ☆26Updated last year
- Fully automated end-to-end framework to extract data from bar plots and other figures in scientific research papers using modules such as…☆111Updated 3 years ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆66Updated 8 months ago
- Repository for the KVP10k dataset☆17Updated last week
- ACL'24 (Oral) Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback☆64Updated 8 months ago
- ☆22Updated 9 months ago
- The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.☆24Updated last year
- [Under Review] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with enla…☆58Updated 7 months ago
- Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023☆103Updated last year