raphael-baena / DTLR
Handwritten Text Recognition and Character Detection
☆146Updated this week
Alternatives and similar repositories for DTLR:
Users that are interested in DTLR are comparing it to the libraries listed below
- [AAAI2025 Oral] Predicting the Original Appearance of Damaged Historical Documents☆74Updated last month
- [NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence☆143Updated 7 months ago
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆146Updated 10 months ago
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆191Updated 5 months ago
- 研究GOT-OCR-项目落地加速,不限语言☆60Updated 6 months ago
- Analysis of Chinese and English layouts 中英文版面分析☆201Updated last month
- ☆124Updated last week
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆232Updated 4 months ago
- VimTS: A Unified Video and Image Text Spotter☆77Updated 5 months ago
- ☆86Updated 4 months ago
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆79Updated 7 months ago
- The Learnable Typewriter: A Generative Approach to Text Line Analysis☆33Updated 5 months ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆214Updated 11 months ago
- ☆28Updated 2 months ago
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆208Updated 2 weeks ago
- 【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling☆116Updated 6 months ago
- The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation☆119Updated 5 months ago
- Document Artifical Intelligence☆160Updated this week
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆37Updated 7 months ago
- ☆56Updated last year
- A Unified Toolkit for Deep Learning-Based Table Extraction☆34Updated 5 months ago
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆51Updated 10 months ago