ACL 2025: Synthetic data generation pipelines for text-rich images.
☆156Mar 1, 2025Updated last year
Alternatives and similar repositories for pixmo-docs
Users that are interested in pixmo-docs are comparing it to the libraries listed below
Sorting:
- [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…☆20Dec 4, 2024Updated last year
- The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.☆26Feb 22, 2024Updated 2 years ago
- A dataset of scientific vector graphics in TikZ for training generative models.☆25Feb 4, 2026Updated last month
- [MM 2025] CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models☆54Oct 20, 2024Updated last year
- [EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs☆59Aug 25, 2025Updated 6 months ago
- -☆24Oct 25, 2022Updated 3 years ago
- Repo for the paper: Towards Few-shot Entity Recognition in Document Images:A Label-aware Sequence-to-Sequence Framework☆14May 31, 2023Updated 2 years ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- [ICLR 2026] P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark☆49Jun 6, 2025Updated 9 months ago
- ☆15May 15, 2025Updated 10 months ago
- Data and code for ACL 2023 paper "RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations"☆15Feb 8, 2024Updated 2 years ago
- [ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective☆202Nov 1, 2023Updated 2 years ago
- The official repository of the dots.vlm1 instruct models proposed by rednote-hilab.☆286Sep 26, 2025Updated 5 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding