MosRat / got.cppLinks

Using Llam.cpp and onnxruntime to accelerate inference of GOT-OCR2.0

☆15

Alternatives and similar repositories for got.cpp

Users that are interested in got.cpp are comparing it to the libraries listed below

Sorting:

1694439208 / GOT-OCR-Inference
研究GOT-OCR-项目落地加速，不限语言
☆62Updated last year
BaofengZan / GOT-OCRv2-onnx
用于学习GOT/Qwen/OnnxLLm
☆53Updated last year
Tencent / POINTS-Reader
☆194Updated 2 months ago
ppaanngggg / yolo-doclaynet
YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis
☆147Updated 6 months ago
Veason-silverbullet / ViTLP
[NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence
☆149Updated last year
SWHL / ChineseDocumentPDF
中文论文、证券类、财报类PDF数据
☆36Updated last year
ucaslcl / Fox
official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"
☆195Updated last year
ppaanngggg / layoutreader
A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.
☆302Updated 5 months ago
kyegomez / Kosmos2.5
My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"
☆74Updated 2 weeks ago
PRITHIVSAKTHIUR / OCR-ReportLab-Notebooks
A dedicated Colab notebooks to experiment (Nanonets OCR, Monkey OCR, OCRFlux 3B, Typhoo OCR 3B & more..) On T4 GPU - free tier
☆23Updated last month
LynnHaDo / Document-Layout-Analysis
Object Detection Model for Scanned Documents
☆94Updated 11 months ago
lucasjinreal / Namo-R1
A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.
☆248Updated 9 months ago
NormXU / nougat-latex-ocr
Codebase for fine-tuning / evaluating nougat-based image2latex generation models
☆159Updated last year
liunian-Jay / MU-GOT
PDF Parsing Tool: GOT's vLLM acceleration implementation, MinerU for layout recognition, and GOT for table formula parsing.
☆65Updated last year
raphael-baena / DTLR
Handwritten Text Recognition and Character Detection
☆164Updated 4 months ago
yujunhuics / LayoutReader
阅读顺序、Layoutreader
☆19Updated 8 months ago
LayTextLLM / LayTextLLM
☆102Updated last year
NormXU / DocParser-Pytorch
An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents
☆37Updated 2 years ago
InternScience / StructEqTable-Deploy
A High-efficiency Open-source Toolkit for Table-to-Latex Task
☆274Updated 2 months ago
Ucas-HaoranWei / Vary-family
☆57Updated 2 years ago
SWHL / TrOCR-Formula-Rec
基于TrOCR + UniMER-1M数据集，训练一个小而美的公式识别模型
☆29Updated 7 months ago
Ucas-HaoranWei / Vary-tiny-600k
Vary-tiny codebase upon LAVIS （for training from scratch）and a PDF image-text pairs data (about 600k including English/Chinese)
☆86Updated last year
ZeningLin / PEneo
[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.
☆40Updated 10 months ago
puhuilab / phocr
an open high-performance Optical Character Recognition (OCR) toolkit
☆306Updated 6 months ago
ai8hyf / TF-ID
TF-ID: Table/Figure IDentifier for academic papers
☆245Updated last year
whlscut / DocLayLLM
[CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding
☆25Updated last month
LingyvKong / OneChart
[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
☆259Updated 9 months ago
poloclub / tsr-convstem
High-Performance Transformers for Table Structure Recognition Need Early Convolutions
☆44Updated last year
CycloneBoy / pdf_table
A Unified Toolkit for Deep Learning-Based Table Extraction
☆58Updated last year
opendatalab / UniMERNet
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
☆453Updated 4 months ago