SWHL / TrOCR-Formula-Rec
基于TrOCR + UniMER-1M数据集,训练一个小而美的公式识别模型
☆19Updated 3 months ago
Alternatives and similar repositories for TrOCR-Formula-Rec:
Users that are interested in TrOCR-Formula-Rec are comparing it to the libraries listed below
- ☆165Updated 11 months ago
- 卡证和文档检测和矫正☆43Updated 5 months ago
- ☆77Updated last month
- ☆41Updated last year
- ☆53Updated 7 months ago
- DocTr++ in PaddlePaddle☆43Updated 6 months ago
- [MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.☆28Updated 2 months ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆27Updated last year
- The official PyTorch implementation of SEMv3.☆34Updated 8 months ago
- This repo is used to release the ArxivFormula dataset.☆24Updated 3 months ago
- 文档图像处理工具(Document image processing tool),包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSha…☆36Updated 5 months ago
- ☆115Updated last year
- ICDAR 2024 Table OCR Model☆28Updated 2 months ago
- 通过浏览器渲染生成表格图像☆211Updated 10 months ago
- 🔥Char detection base on crnn 字符(单字)检测基于CRNN☆77Updated last year
- The official implementation of SPTS v2: Single-Point Text Spotting☆131Updated last year
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆27Updated last year
- The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer☆49Updated 8 months ago
- Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)☆122Updated last year
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆46Updated 8 months ago
- 研究GOT-OCR-项目落地加速,不限语言☆58Updated 3 months ago
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆77Updated 5 months ago
- convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino☆32Updated last year
- OCR Document image deformation correction.复现阿里OCR皱巴巴文档图像形变矫正☆93Updated 4 years ago
- ☆33Updated last year
- 用于学习GOT/Qwen/OnnxLLm☆47Updated 4 months ago
- [TAI 2023] Appearance Enhancement for Camera-captured Document Images in the Wild☆31Updated last year
- CDLA: A Chinese document layout analysis (CDLA) dataset☆258Updated 3 years ago
- GOT的vLLM加速实现 并结合 MinerU 实现RAG中的pdf 解析☆46Updated 3 months ago
- This repository is the implementation of "Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Contex…☆85Updated last year