☆36Oct 7, 2023Updated 2 years ago
Alternatives and similar repositories for ocr-dataset-rendering
Users that are interested in ocr-dataset-rendering are comparing it to the libraries listed below
Sorting:
- Render documents on a virtual paper with folds and other types of damage using blender geometry nodes.☆26Aug 14, 2023Updated 2 years ago
- ☆48Feb 7, 2025Updated last year
- ☆13May 26, 2025Updated 9 months ago
- PyTorch implementation of "UNIT: Unifying Image and Text Recognition in One Vision Encoder", NeurlPS 2024.☆34Sep 26, 2024Updated last year
- Repo☆12Mar 7, 2022Updated 3 years ago
- A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo☆34Aug 12, 2024Updated last year
- [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…☆20Dec 4, 2024Updated last year
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Dec 6, 2022Updated 3 years ago
- Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding☆57Feb 10, 2026Updated 3 weeks ago
- convert equation inside word(.docx) to latex☆24Oct 17, 2025Updated 4 months ago
- ☆20Apr 24, 2024Updated last year
- Synthetic identity documents dataset☆35Mar 4, 2025Updated last year
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆63May 15, 2025Updated 9 months ago
- This repository is a concise collection of well known deep learning based document binarization models.☆27Dec 24, 2022Updated 3 years ago
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆23Sep 17, 2024Updated last year
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆260Apr 14, 2025Updated 10 months ago
- Camera-based Document Analysis☆26Jul 7, 2025Updated 7 months ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30May 23, 2023Updated 2 years ago
- The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)☆27Sep 3, 2023Updated 2 years ago
- ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K …☆139Feb 7, 2025Updated last year
- Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> O…☆33Aug 18, 2021Updated 4 years ago
- Update the latest text-related papers from top conferences☆27Mar 12, 2025Updated 11 months ago
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆74Jun 11, 2024Updated last year
- Code for the paper "UVDoc: Neural Grid-based Document Unwarping" - Dataset capture and creation☆31May 27, 2024Updated last year
- ☆37May 7, 2023Updated 2 years ago
- Basic HTR concepts/modules to boost performance