[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
☆266Apr 14, 2025Updated last year
Alternatives and similar repositories for OneChart
Users that are interested in OneChart are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆196May 31, 2024Updated 2 years ago
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆89Sep 21, 2024Updated last year
- [ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.☆1,889Dec 30, 2024Updated last year
- Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step☆161Jul 28, 2025Updated 11 months ago
- Keypoint dataset for airplane☆10Dec 28, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official Repository of ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning☆259Sep 26, 2024Updated last year
- Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)☆629Dec 30, 2024Updated last year
- ☆57Jan 23, 2024Updated 2 years ago
- The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.☆26Feb 22, 2024Updated 2 years ago
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,408May 30, 2025Updated last year
- ☆39Oct 7, 2023Updated 2 years ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,146Feb 10, 2025Updated last year
- On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)☆854Jun 12, 2026Updated 2 weeks ago
- ☆257Dec 7, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [IEEE VIS 2024] LLaVA-Chart: Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruc…☆75Jan 22, 2025Updated last year
- A curated list of recent and past chart understanding work based on our IEEE TKDE survey paper: From Pixels to Insights: A Survey on Auto…☆241Dec 17, 2025Updated 6 months ago
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆83Jun 20, 2023Updated 3 years ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,831Mar 17, 2026Updated 3 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆305Sep 10, 2024Updated last year
- 研究GOT-OCR-项目落地加速,不限语言☆62Oct 24, 2024Updated last year
- Accelerating GOT-OCRv2 with VLLM☆10Nov 15, 2024Updated last year
- 中文论文、证券类、财报类PDF数据☆41Jun 13, 2024Updated 2 years ago
- ☆143Feb 13, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆102Dec 23, 2024Updated last year
- 通过浏览器渲染生成表格图像☆238Apr 10, 2024Updated 2 years ago
- ☆19Dec 6, 2023Updated 2 years ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆487Sep 28, 2025Updated 9 months ago
- CDLA: A Chinese document layout analysis (CDLA) dataset☆293Sep 13, 2021Updated 4 years ago
- Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)☆1,948Jun 2, 2026Updated last month
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆321Aug 15, 2025Updated 10 months ago
- [ACL 2024] ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.☆135Sep 7, 2024Updated last year
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆24Dec 11, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding☆100Apr 1, 2025Updated last year
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆162May 31, 2024Updated 2 years ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆277Dec 6, 2025Updated 6 months ago
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆227Jun 12, 2025Updated last year
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆95Jan 7, 2025Updated last year
- Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model☆281Jun 25, 2024Updated 2 years ago
- ☆48Feb 7, 2025Updated last year