mingliangzhang2018 / PGDPLinks

The first end-to-end deep learning model for explicit plane geometry diagram parsing.

☆50

Alternatives and similar repositories for PGDP

Users that are interested in PGDP are comparing it to the libraries listed below

Sorting:

mingliangzhang2018 / PGPS
The implement of geometric solver PGPSNet
☆27Updated 6 months ago
zezeze97 / DFE-GPS
☆12Updated 3 weeks ago
Ucas-HaoranWei / Vary-tiny-600k
Vary-tiny codebase upon LAVIS （for training from scratch）and a PDF image-text pairs data (about 600k including English/Chinese)
☆85Updated 10 months ago
SCUT-DLVCLab / GPT-4V_OCR
Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)
☆125Updated last year
Ucas-HaoranWei / Slow-Perception
Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step
☆131Updated last week
LukeForeverYoung / UReader
☆137Updated last year
lupantech / InterGPS
Data and Code for ACL 2021 Paper "Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning"
☆156Updated 4 months ago
Token-family / TokenFD
[ICCV2025] A Token-level Text Image Foundation Model for Document Understanding
☆111Updated last week
sakura2233565548 / TabPedia
This repository is the codebase of TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy
☆42Updated 9 months ago
HCIILAB / M6Doc
☆143Updated 3 months ago
bytedance / E2STR
The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
☆53Updated last year
GeoEval / GeoEval
This is the Repository for Geometry Problem Solving Method Evaluation
☆24Updated 10 months ago
bytedance / TextHarmony
The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation
☆129Updated 8 months ago
mayubo2333 / MMLongBench-Doc
Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations
☆90Updated last year
Green-Wood / CoMER
Official implementation for ECCV 2022 paper "CoMER: Modeling Coverage for Transformer-based Handwritten Mathematical Expression Recogniti…
☆125Updated 2 years ago
harrytea / Awesome-Document-Understanding
Document Artifical Intelligence
☆184Updated 3 months ago
LingyvKong / OneChart
[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
☆227Updated 3 months ago
guoxy25 / Ocean-OCR
☆37Updated 6 months ago
ycpNotFound / GeoGen
A pipeline for the automatic construction of geometry problems along with step-by-step solutions.
☆13Updated last month
MAEHCM / ICL-D3IE
Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”
☆53Updated 2 years ago
OpenGVLab / ChartAst
[ACL 2024] ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.
☆123Updated 11 months ago
yuyq96 / TextHawk
Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models
☆62Updated 9 months ago
large-ocr-model / large-ocr-model.github.io
☆181Updated last year
yh-hust / PDF-Wukong
【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling
☆122Updated 2 months ago
microsoft / ArxivFormula
This repo is used to release the ArxivFormula dataset.
☆31Updated 8 months ago
SpursGoZmy / Table-LLaVA
Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …
☆213Updated last month
LayTextLLM / LayTextLLM
☆96Updated 7 months ago
bytedance / MTVQA
MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…
☆62Updated 2 months ago
HCIILAB / LAST
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition
☆28Updated last year
ucaslcl / Fox
official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"
☆153Updated last year