NormXU / nougat-latex-ocr
Codebase for fine-tuning / evaluating nougat-based image2latex generation models
☆124Updated last month
Related projects ⓘ
Alternatives and complementary repositories for nougat-latex-ocr
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆304Updated 2 weeks ago
- TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability,…☆342Updated 3 months ago
- Large scale training of Latex formula recognition model, currently being organized and open source☆43Updated 7 months ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆104Updated 5 months ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆206Updated last month
- Object Detection Model for Scanned Documents☆82Updated last year
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆150Updated 2 weeks ago
- LaTeX OCR 的数据仓库☆97Updated 5 months ago
- A large scale camera-taken table detection and recognition dataset.☆112Updated last year
- A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating…☆136Updated 2 months ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆41Updated 7 months ago
- ☆67Updated this week
- ☆47Updated 4 months ago
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆128Updated 5 months ago
- Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集☆29Updated last year
- Another LaTex equation OCR tool based on ConvNeXt and Transformer☆47Updated last year
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis☆68Updated last month
- 源自PP-Structure的表格识别算法,模型转换为ONNX,推理引擎采用ONNXRuntime,部署简单,无内存泄露问题。☆70Updated last week
- 研究GOT-OCR-项目落地加速,不限语言☆51Updated 3 weeks ago
- doc2x docs☆30Updated this week
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆129Updated last year
- Table Structure Recognition☆62Updated last year
- 文档方向分类☆202Updated last week
- Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)☆121Updated last year
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆274Updated last year
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆164Updated this week
- Math OCR model that outputs LaTeX and markdown☆907Updated 3 weeks ago
- YOLOv10 trained on DocLayNet dataset.☆58Updated 2 weeks ago
- Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, cand you can get the same (even better) result compared w…☆41Updated 4 months ago
- A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition☆91Updated 3 months ago