Cvrane / ChartReaderLinks
Fully automated end-to-end framework to extract data from bar plots and other figures in scientific research papers using modules such as OpenCV, AWS-Rekognition.
☆128Updated 4 years ago
Alternatives and similar repositories for ChartReader
Users that are interested in ChartReader are comparing it to the libraries listed below
Sorting:
- ☆146Updated 2 years ago
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆134Updated 2 months ago
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆74Updated this week
- Object Detection Model for Scanned Documents☆93Updated 10 months ago
- Context-Aware Chart Element Detection☆50Updated 3 months ago
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆79Updated 2 years ago
- Dataset and scripts for HRDoc☆41Updated 2 years ago
- ☆32Updated last year
- Official Implementation of TFLOP: Table Structure Recognition Framework with Layout Pointer Mechanism☆46Updated 4 months ago
- Chart-to-Text: Generating Natural Language Explanations for Charts by Adapting the Transformer Model☆158Updated 2 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆115Updated last year
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆52Updated 3 years ago
- ☆56Updated 2 years ago
- ☆249Updated 2 years ago
- Datasets and Evaluation Scripts for CompHRDoc☆55Updated 10 months ago
- DocLLM: A layout-aware generative language model for multimodal document understanding☆133Updated 2 years ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated 2 years ago
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆159Updated last year
- Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023☆109Updated 2 years ago
- SciCap Dataset☆56Updated 4 years ago
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆286Updated 2 years ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆44Updated last year
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆77Updated 5 months ago
- YOLOv10 trained on DocLayNet dataset.☆80Updated last year
- ☆234Updated 8 months ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆179Updated 2 years ago
- ☆78Updated 2 years ago
- [ICCV 2023] ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules☆28Updated last year
- TF-ID: Table/Figure IDentifier for academic papers☆245Updated last year
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆404Updated 2 years ago