[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
☆259Apr 14, 2025Updated 10 months ago
Alternatives and similar repositories for OneChart
Users that are interested in OneChart are comparing it to the libraries listed below
Sorting:
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆195May 31, 2024Updated last year
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆86Sep 21, 2024Updated last year
- [ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.☆1,897Dec 30, 2024Updated last year
- Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step☆159Jul 28, 2025Updated 7 months ago
- Official Repository of ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning☆250Sep 26, 2024Updated last year
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆84Jun 20, 2023Updated 2 years ago
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,370May 30, 2025Updated 9 months ago
- The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.☆26Feb 22, 2024Updated 2 years ago
- On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)☆796Jul 5, 2025Updated 7 months ago
- ☆36Oct 7, 2023Updated 2 years ago
- A curated list of recent and past chart understanding work based on our IEEE TKDE survey paper: From Pixels to Insights: A Survey on Auto…☆232Dec 17, 2025Updated 2 months ago
- Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)☆630Dec 30, 2024Updated last year
- ☆102Dec 23, 2024Updated last year
- ☆57Jan 23, 2024Updated 2 years ago
- ☆256Dec 7, 2023Updated 2 years ago
- 中文论文、证券类、财报类PDF数据☆36Jun 13, 2024Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,089Feb 10, 2025Updated last year
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,820Apr 9, 2025Updated 10 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆306Sep 10, 2024Updated last year
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆225Jun 12, 2025Updated 8 months ago
- [IEEE VIS 2024] LLaVA-Chart: Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruc…☆75Jan 22, 2025Updated last year
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆95Jan 7, 2025Updated last year
- ☆142Feb 13, 2024Updated 2 years ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆312Aug 15, 2025Updated 6 months ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆458Sep 28, 2025Updated 5 months ago
- ☆79May 6, 2024Updated last year
- Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)☆1,947Jan 24, 2026Updated last month
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆275Dec 6, 2025Updated 2 months ago
- ☆19Dec 6, 2023Updated 2 years ago
- [ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding☆95Apr 1, 2025Updated 11 months ago
- ✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models☆164Dec 26, 2024Updated last year
- Keypoint dataset for airplane☆10Dec 28, 2019Updated 6 years ago
- Python and JS tools to generate Printed LaTex formulas and images☆16Oct 26, 2023Updated 2 years ago
- [AAAI 2025]Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning☆43Apr 14, 2025Updated 10 months ago
- 通过浏览器渲染生成表格图像☆236Apr 10, 2024Updated last year
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Jul 9, 2024Updated last year
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10May 30, 2024Updated last year
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆124Sep 28, 2025Updated 5 months ago
- Official repository of MMDU dataset☆104Sep 29, 2024Updated last year