[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
☆261Apr 14, 2025Updated 11 months ago
Alternatives and similar repositories for OneChart
Users that are interested in OneChart are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆194May 31, 2024Updated last year
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆86Sep 21, 2024Updated last year
- [ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.☆1,897Dec 30, 2024Updated last year
- Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step☆159Jul 28, 2025Updated 7 months ago
- Keypoint dataset for airplane☆10Dec 28, 2019Updated 6 years ago
- Official Repository of ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning☆252Sep 26, 2024Updated last year
- Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)☆630Dec 30, 2024Updated last year
- ☆57Jan 23, 2024Updated 2 years ago
- The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.☆26Feb 22, 2024Updated 2 years ago
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,375May 30, 2025Updated 9 months ago
- ☆36Oct 7, 2023Updated 2 years ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,104Feb 10, 2025Updated last year
- On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)☆802Jul 5, 2025Updated 8 months ago
- ☆256Dec 7, 2023Updated 2 years ago
- [IEEE VIS 2024] LLaVA-Chart: Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruc…☆75Jan 22, 2025Updated last year
- A curated list of recent and past chart understanding work based on our IEEE TKDE survey paper: From Pixels to Insights: A Survey on Auto…☆233Dec 17, 2025Updated 3 months ago
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆84Jun 20, 2023Updated 2 years ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,825Updated this week
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆307Sep 10, 2024Updated last year
- 研究GOT-OCR-项目落地加速,不限语言☆61Oct 24, 2024Updated last year
- Accelerating GOT-OCRv2 with VLLM☆11Nov 15, 2024Updated last year
- 中文论文、证券类、财报类PDF数据☆39Jun 13, 2024Updated last year
- ☆143Feb 13, 2024Updated 2 years ago
- 通过浏览器渲染生成表格图像☆236Apr 10, 2024Updated last year
- ☆102Dec 23, 2024Updated last year
- ☆19Dec 6, 2023Updated 2 years ago
- CDLA: A Chinese document layout analysis (CDLA) dataset☆289Sep 13, 2021Updated 4 years ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆459Sep 28, 2025Updated 5 months ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆317Aug 15, 2025Updated 7 months ago
- Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)☆1,950Jan 24, 2026Updated last month
- [ACL 2024] ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.☆132Sep 7, 2024Updated last year
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆24Dec 11, 2024Updated last year
- [ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding☆95Apr 1, 2025Updated 11 months ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆276Dec 6, 2025Updated 3 months ago
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆162May 31, 2024Updated last year
- ☆79May 6, 2024Updated last year
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …