zhiqic / ChartReaderLinks

[ICCV 2023] ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules

☆26

Alternatives and similar repositories for ChartReader

Users that are interested in ChartReader are comparing it to the libraries listed below

Sorting:

naver-ai / cream
Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023
☆46Updated last year
rubenpt91 / MP-DocVQA-Framework
☆67Updated last year
vis-nlp / UniChart
☆82Updated last year
naver-ai / tablevqabench
☆45Updated last year
DS3Lab / WordScape
The WordScape repository contains code for the WordScape pipeline to create datasets to train document understanding models.
☆37Updated last year
soap117 / DeepRule
☆145Updated 2 years ago
dali92002 / SSL-OCR
Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023
☆28Updated 2 years ago
andreagemelli / doc2graph
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
☆133Updated last month
nttmdlab-nlp / InstructDoc
InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)
☆159Updated last year
guoxy25 / Ocean-OCR
☆43Updated 9 months ago
emanuelevivoli / awesome-comics-understanding
The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"
☆129Updated 10 months ago
vis-nlp / ChartQA
☆227Updated 7 months ago
kongds / E5-V
E5-V: Universal Embeddings with Multimodal Large Language Models
☆274Updated 11 months ago
Victorwz / MLM_Filter
Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".
☆68Updated 7 months ago
FuxiaoLiu / MMC
[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning
☆96Updated 10 months ago
NiteshMethani / PlotQA
Dataset introduced in PlotQA: Reasoning over Scientific Plots
☆80Updated 2 years ago
ParadoxZW / LLaVA-UHD-Better
A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo
☆34Updated last year
jfma-USTC / HRDoc
Dataset and scripts for HRDoc
☆40Updated 2 years ago
thunlp / ChartCoder
[ACL'25 Main] ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation
☆65Updated last week
SALT-NLP / LLaVAR
Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"
☆269Updated last year
furkanbiten / idl_data
OCR Annotations from Amazon Textract for Industry Documents Library
☆103Updated 3 years ago
kyegomez / PALI
Democratization of "PaLI: A Jointly-Scaled Multilingual Language-Image Model"
☆91Updated last year
ByungKwanLee / Meteor
[NeurIPS 2024] Official PyTorch implementation code for realizing the technical part of Mamba-based traversal of rationale (Meteor) to im…
☆116Updated last year
IBM / KVP10k
Repository for the KVP10k dataset
☆21Updated 2 months ago
ayanban011 / SwinDocSegmenter
[ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
☆76Updated last year
johnning2333 / M2Doc
☆39Updated last year
amazon-science / QA-ViT
☆69Updated last year
OpenGVLab / ChartAst
[ACL 2024] ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.
☆131Updated last year
LukeForeverYoung / UReader
☆142Updated last year
opendatalab / CLIP-Parrot-Bias
ECCV2024_Parrot Captions Teach CLIP to Spot Text
☆65Updated last year