GbotHQ/ocr-dataset-rendering

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GbotHQ/ocr-dataset-rendering)

GbotHQ / ocr-dataset-rendering

☆39

Alternatives and similar repositories for ocr-dataset-rendering

Users that are interested in ocr-dataset-rendering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GbotHQ / Blender-3D-document-rendering-pipeline
View on GitHub
Render documents on a virtual paper with folds and other types of damage using blender geometry nodes.
☆27Aug 14, 2023Updated 2 years ago
lqzxt / NGTR
View on GitHub
☆14May 26, 2025Updated last year
guoxy25 / Ocean-OCR
View on GitHub
☆48Feb 7, 2025Updated last year
khumbuai / BERT-keras-minimal
View on GitHub
Keras BERT with pre-trained weights
☆10Feb 10, 2019Updated 7 years ago
Topdu / DocPTBench
View on GitHub
Benchmarking End-to-End Photographed Document Parsing and Translation
☆17Dec 4, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Alpha-Innovator / DocParser
View on GitHub
☆18Jan 13, 2025Updated last year
TenMilesLotus / DTSM
View on GitHub
Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator
☆13Apr 28, 2024Updated 2 years ago
ParadoxZW / LLaVA-UHD-Better
View on GitHub
A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo
☆35Aug 12, 2024Updated last year
h2oai / doctr
View on GitHub
docTR by Mindee (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Lear…
☆11May 19, 2026Updated 2 months ago
yeezhu / UNIT
View on GitHub
PyTorch implementation of "UNIT: Unifying Image and Text Recognition in One Vision Encoder", NeurlPS 2024.
☆34Sep 26, 2024Updated last year
ainatersol / Vesuvius-InkDetection
View on GitHub
The Vesuvius Challenge is a machine learning and computer vision competition to read the Herculaneum Papyri.
☆15Aug 28, 2023Updated 2 years ago
MAEHCM / AET
View on GitHub
Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”
☆18Dec 6, 2022Updated 3 years ago
PkuDavidGuan / CurvedSynthText
View on GitHub
☆41Nov 30, 2019Updated 6 years ago
SCUT-DLVCLab / RFUND
View on GitHub
[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…
☆21Dec 4, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
NExTplusplus / TAT-DQA
View on GitHub
TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning
☆24Sep 17, 2024Updated last year
ucaslcl / Fox
View on GitHub
official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"
☆196May 31, 2024Updated 2 years ago
nttmdlab-nlp / InstructDoc
View on GitHub
InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)
☆162May 31, 2024Updated 2 years ago
bytedance / E2STR
View on GitHub
The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
☆55Jun 14, 2024Updated 2 years ago
huggingface / docmatix
View on GitHub
A huge dataset for Document Visual Question Answering
☆24Jul 29, 2024Updated last year
wangyuxin87 / Tampered-IC13
View on GitHub
-
☆24Oct 25, 2022Updated 3 years ago
clovaai / spade
View on GitHub
☆82Jun 12, 2023Updated 3 years ago
data-liberation / table-understanding-dataset
View on GitHub
table understanding dataset for comparative evaluation of different table understanding algorithms
☆13Jun 15, 2018Updated 8 years ago
milely / SRN.Pytorch
View on GitHub
Unofficial implementation of Towards Accurate Scene Text Recognition with Semantic Reasoning Networks
☆28Sep 24, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
SII-sc22mc / DocFusion
View on GitHub
A Unified Framework for Document Parsing Tasks (Including Document Layout Analysis, OCR, Formula Recognition, and Table Recognition)
☆15Jul 1, 2025Updated last year
Yuliang-Liu / MultimodalOCR
View on GitHub
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
☆873Updated this week
ajjimeno / icdar-task-b
View on GitHub
Repo
☆13Mar 7, 2022Updated 4 years ago
CyrilSterling / LPV
View on GitHub
The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)
☆26Sep 3, 2023Updated 2 years ago
TongkunGuan / Text-Related-Papers
View on GitHub
Update the latest text-related papers from top conferences
☆31Mar 12, 2025Updated last year
Sanster / OhMyTable
View on GitHub
Table Structure Recognition
☆28Jul 25, 2024Updated last year
pengts / VW-LMM
View on GitHub
☆25May 13, 2024Updated 2 years ago
LingyvKong / OneChart
View on GitHub
[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
☆266Apr 14, 2025Updated last year
google-research-datasets / screen_qa
View on GitHub
ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K …
☆151Feb 7, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Picsart-AI-Research / Social-Reward
View on GitHub
[ICLR 2024 Spotlight] Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Communi…
☆12Mar 29, 2024Updated 2 years ago
DrLuo / SemiETS
View on GitHub
【CVPR 2025】SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting
☆17Jul 1, 2025Updated last year
CSU-JPG / TextAtlas
View on GitHub
[ICML 2026]A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation
☆93Sep 27, 2025Updated 9 months ago
bytedance / MTVQA
View on GitHub
MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…
☆64May 15, 2025Updated last year
Zhenhang-Li / GlyphOnly
View on GitHub
【2024 ECAI】First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending
☆14Jun 16, 2025Updated last year
wennyuhey / KD-DETR
View on GitHub
☆19Jun 25, 2025Updated last year
ManuelPalermo / AndroidVideoSegmentation
View on GitHub
Android video semantic segmentation using DeeplabV3+ lite
☆10Sep 20, 2019Updated 6 years ago