yh-hust / PDF-WukongView external linksLinks
【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling
☆127Jun 4, 2025Updated 8 months ago
Alternatives and similar repositories for PDF-Wukong
Users that are interested in PDF-Wukong are comparing it to the libraries listed below
Sorting:
- 卡证和文档检测和矫正☆79Sep 18, 2024Updated last year
- [Paper] Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document"☆17Dec 1, 2023Updated 2 years ago
- convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino☆33Aug 16, 2023Updated 2 years ago
- VisuRiddles: Fine-grained Perception is a important thing for Multimodal Large Models in Riddles Solving☆18Oct 22, 2025Updated 3 months ago
- Pytorch implements SA-Text: Simple but Accurate Detector for Text of Arbitrary Shapes☆42Jun 25, 2020Updated 5 years ago
- [ICLR 2026] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning☆72Dec 17, 2025Updated last month
- ☆42Sep 2, 2023Updated 2 years ago
- Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)☆1,947Jan 24, 2026Updated 3 weeks ago
- ☆12Jul 8, 2021Updated 4 years ago
- ☆31Dec 18, 2025Updated last month
- ☆19Sep 11, 2024Updated last year
- Tools for ICDAR2019 competitions(fifth place)☆11May 6, 2019Updated 6 years ago
- ☆102Dec 23, 2024Updated last year
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆48Jun 13, 2024Updated last year
- [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…☆20Dec 4, 2024Updated last year
- ocr data ,detect data ,recognize data☆29Mar 24, 2020Updated 5 years ago
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆32Jun 12, 2025Updated 8 months ago
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆225Jun 12, 2025Updated 8 months ago
- Official implementation of PageNet (IJCV 2022)☆81Oct 31, 2022Updated 3 years ago
- Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)☆280Dec 26, 2021Updated 4 years ago
- Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)☆200Jun 17, 2024Updated last year
- ICDAR 2024 Table OCR Model☆39Feb 4, 2026Updated last week
- end2end layout analysis based seq2seq☆132Mar 8, 2021Updated 4 years ago
- OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commer…☆1,181Feb 8, 2026Updated last week
- 面向大模型的民族文化数据集☆12May 26, 2025Updated 8 months ago
- 【CVPR 2025】SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting☆16Jul 1, 2025Updated 7 months ago
- ☆60Dec 10, 2025Updated 2 months ago
- A curated list of resources dedicated to table recognition☆406Dec 12, 2024Updated last year
- A PyTorch implementation of "ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network" (CVPR 2020 oral)☆431Apr 28, 2022Updated 3 years ago
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆45Apr 11, 2025Updated 10 months ago
- CHIP2018问句匹配大赛 Rank6解决方案☆21Nov 27, 2018Updated 7 years ago
- 通过浏览器渲染生成表格图像☆236Apr 10, 2024Updated last year
- ☆13May 28, 2025Updated 8 months ago
- Graph Key Information Extraction: GKIE☆11Sep 15, 2022Updated 3 years ago
- [ICCV 2025] LIRA☆21Nov 25, 2025Updated 2 months ago
- [ECCV 2024] The official PyTorch implementation of the "Plain-Det: A Plain Multi-Dataset Object Detector".☆30Dec 8, 2024Updated last year
- Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining☆353Nov 29, 2023Updated 2 years ago
- A pytorch re-implementation of Convolutional recurrent network in pytorch☆40Jun 19, 2020Updated 5 years ago
- ☆78Aug 7, 2023Updated 2 years ago