MosRat / got.cppLinks
Using Llam.cpp and onnxruntime to accelerate inference of GOT-OCR2.0
☆15Updated 11 months ago
Alternatives and similar repositories for got.cpp
Users that are interested in got.cpp are comparing it to the libraries listed below
Sorting:
- 研究GOT-OCR-项目落地加速,不限语言☆62Updated last year
- 用于学习GOT/Qwen/OnnxLLm☆53Updated last year
- ☆194Updated 2 months ago
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis☆147Updated 6 months ago
- [NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence☆149Updated last year
- 中文论文、证券类、财报类PDF数据☆36Updated last year
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆195Updated last year
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆302Updated 5 months ago
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆74Updated 2 weeks ago
- A dedicated Colab notebooks to experiment (Nanonets OCR, Monkey OCR, OCRFlux 3B, Typhoo OCR 3B & more..) On T4 GPU - free tier☆23Updated last month
- Object Detection Model for Scanned Documents☆94Updated 11 months ago
- A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.☆248Updated 9 months ago
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆159Updated last year
- PDF Parsing Tool: GOT's vLLM acceleration implementation, MinerU for layout recognition, and GOT for table formula parsing.☆65Updated last year
- Handwritten Text Recognition and Character Detection☆164Updated 4 months ago
- 阅读顺序、Layoutreader☆19Updated 8 months ago
- ☆102Updated last year
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆37Updated 2 years ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆274Updated 2 months ago
- ☆57Updated 2 years ago
- 基于TrOCR + UniMER-1M数据集,训练一个小而美的公式识别模型☆29Updated 7 months ago
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆86Updated last year
- [MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.☆40Updated 10 months ago
- an open high-performance Optical Character Recognition (OCR) toolkit☆306Updated 6 months ago
- TF-ID: Table/Figure IDentifier for academic papers☆245Updated last year
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆25Updated last month
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆259Updated 9 months ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆44Updated last year
- A Unified Toolkit for Deep Learning-Based Table Extraction☆58Updated last year
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆453Updated 4 months ago