PRITHIVSAKTHIUR / OCR-ReportLab-NotebooksLinks
A dedicated Colab notebooks to experiment (Nanonets OCR, Monkey OCR, OCRFlux 3B, Typhoo OCR 3B & more..) On T4 GPU - free tier
☆22Updated 2 months ago
Alternatives and similar repositories for OCR-ReportLab-Notebooks
Users that are interested in OCR-ReportLab-Notebooks are comparing it to the libraries listed below
Sorting:
- ☆167Updated 3 weeks ago
- 研究GOT-OCR-项目落地加速,不限语言☆62Updated 11 months ago
- SPRINT: Script-agnostic Structure Recognition in Tables☆13Updated 6 months ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆38Updated last year
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆37Updated 2 years ago
- 阅读顺序、Layoutreader☆19Updated 5 months ago
- ☆98Updated 9 months ago
- ICDAR 2024 Table OCR Model☆38Updated 2 months ago
- ☆27Updated 11 months ago
- [MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.☆37Updated 6 months ago
- GLM Series Edge Models☆149Updated 3 months ago
- A handwritten Chemical Structure Image data set named EDU-CHEMC, which consists of totally 52,987 handwritten molecular structure images …☆12Updated 4 months ago
- ☆57Updated last year
- Chinese CLIP models with SOTA performance.☆58Updated 2 years ago
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆32Updated 3 months ago
- VimTS: A Unified Video and Image Text Spotter☆78Updated 11 months ago
- 用于学习GOT/Qwen/OnnxLLm☆53Updated last year
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆155Updated last year
- Our 2nd-gen LMM☆34Updated last year
- A Unified Toolkit for Deep Learning-Based Table Extraction☆51Updated 10 months ago
- ☆29Updated last year
- Cook up amazing multimodal AI applications effortlessly with MiniCPM-o☆202Updated last week
- [NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence☆146Updated last year
- (ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆87Updated 3 months ago
- ☆15Updated 2 months ago
- ☆26Updated last week
- 基于TrOCR + UniMER-1M数据集,训练一个小而美的公式识别模型☆27Updated 3 months ago
- [arXiv: 2505.17163] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning☆64Updated 2 months ago
- 利用Swin-Unet(Swin Transformer Unet)实现对文档图片里表格结构的识别,Swin-unet (Swin Transformer Unet) is used to identify the document table structure☆25Updated last year
- [EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation☆103Updated this week