willpat1213/OCR-Datasets

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/willpat1213/OCR-Datasets)

willpat1213 / OCR-Datasets

总结OCR领域的主流公开数据集，包含检测&识别、各种场景、各种语言的数据集，并提供数据集的相关信息及下载链接。

☆43

Alternatives and similar repositories for OCR-Datasets

Users that are interested in OCR-Datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shuyansy / Visual-Text-Processing-survey
View on GitHub
The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""
☆103Oct 20, 2025Updated 9 months ago
wuys13 / Multi-Drug-Transfer-Learning
View on GitHub
Pre-clinical drug discovery faces the low efficiency dilemma. One of the reasons is the lack of cross-drug efficacy evaluation infrastruc…
☆14Dec 8, 2025Updated 7 months ago
shannanyinxiang / SPTS
View on GitHub
Official implementation of SPTS: Single-Point Text Spotting (ACM MM 2022 Oral)
☆145Jul 26, 2023Updated 3 years ago
Mountchicken / Text-Recognition-on-Cross-Domain-Datasets
View on GitHub
Improved Text recognition algorithms on different text domains like scene text, handwritten, document, Chinese/English, even ancient book…
☆80Feb 4, 2023Updated 3 years ago
Helen-Cheung / Baidu-AI-Challenge-Scene-Text-Removal
View on GitHub
☆15Feb 28, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mxin262 / Bridging-Text-Spotting
View on GitHub
(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.
☆75Jun 11, 2024Updated 2 years ago
jiang-junyao / CACIMAR
View on GitHub
cross-species analysis of cell identities, markers and regulations
☆13Jul 11, 2025Updated last year
ChenJiayi68 / DMTNet
View on GitHub
☆12Aug 15, 2024Updated last year
XingruiWang / wheres-waldo
View on GitHub
A Pytorch implementing of A Deep Learning approach to Template Matching. Usie Hypernet + VGG to match the templates.
☆13Dec 18, 2021Updated 4 years ago
Zhenhang-Li / GlyphOnly
View on GitHub
【2024 ECAI】First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending
☆14Jun 16, 2025Updated last year
TenMilesLotus / DTSM
View on GitHub
Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator
☆13Apr 28, 2024Updated 2 years ago
linnarsson-lab / chromograph
View on GitHub
Camiel's scATAC-seq analysis pipeline
☆14Apr 15, 2024Updated 2 years ago
ventolab / Human-Maternal-Fetal-Interface_MFI
View on GitHub
☆16Jan 24, 2023Updated 3 years ago
iiclab / DecompST
View on GitHub
☆15Nov 26, 2023Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
saqib22 / Table-Structure_Extraction-Bi-directional-GRU
View on GitHub
This repo contains the code for "Table Structure Extraction with Bi-directional Gated Recurrent Unit Networks", ICDAR 2019..
☆19Jul 13, 2023Updated 3 years ago
shannanyinxiang / ViTEraser
View on GitHub
Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20…
☆66Jul 4, 2024Updated 2 years ago
mxin262 / SwinTextSpotter
View on GitHub
Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (…
☆289Nov 29, 2024Updated last year
PriNing / ODM
View on GitHub
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
☆45Apr 11, 2025Updated last year
chiaoooo / PngToTTF
View on GitHub
切割手寫 png 打包 ttf
☆14Aug 4, 2024Updated last year
BIMSBbioinfo / bavaria
View on GitHub
Batch-adversarial variational auto-encoder (BAVARIA) for simultaneous dimensionality reduction and integration of single-cell ATAC-seq da…
☆16May 15, 2023Updated 3 years ago
DBinary / STARNet_tutorials
View on GitHub
Repository for the tutorials of STARNet, a computational framework designed to decipher spatially specific gene regulatory networks (GRNs…
☆15Jun 9, 2026Updated last month
FangShancheng / ABINet-PP
View on GitHub
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting
☆90Feb 11, 2023Updated 3 years ago
fuxialexander / genomespy
View on GitHub
☆15Jan 7, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
plutoyuxie / toonify-game-character
View on GitHub
☆18Mar 19, 2021Updated 5 years ago
chriscainx / gastric-cancer
View on GitHub
☆15Mar 2, 2023Updated 3 years ago
AndreasAZiegler / LCEBDFTB
View on GitHub
Low-Computation Egocentric Barcode Detector for the Blind
☆10Jun 9, 2017Updated 9 years ago
jiangnanboy / table_structure_recognition
View on GitHub
利用Swin-Unet(Swin Transformer Unet)实现对文档图片里表格结构的识别，Swin-unet (Swin Transformer Unet) is used to identify the document table structure
☆27Feb 23, 2024Updated 2 years ago
VinAIResearch / HyperCUT
View on GitHub
HyperCUT: Video Sequence from a Single Blurry Image using Unsupervised Ordering (CVPR'23)
☆14Nov 4, 2025Updated 8 months ago
deepkyu / ml-talking-face
View on GitHub
Cloned repository from Hugging Face Spaces (CVPR 2022 Demo)
☆54Sep 29, 2022Updated 3 years ago
peterWon / GLoc3D
View on GitHub
Implementation of our paper "Global Localization in Large-scale Point Clouds via Roll-pitch-yaw Invariant Place Recognition and Low-overl…
☆10Nov 25, 2023Updated 2 years ago
nxp-imx / tflite-vx-delegate-imx
View on GitHub
Tflite VX Delegate i.MX Machine Learning
☆13Jul 20, 2026Updated last week
songyiren98 / CLIPFont
View on GitHub
Implementation of paper: CLIPFont: Texture Guided Vector WordArt Generation
☆18Oct 8, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
huanranchen / ASRNorm
View on GitHub
a pytorch implement of Adversarially Adaptive Normalization for Single Domain Generalization
☆15Jul 25, 2023Updated 3 years ago
SimpleVQA / SimpleVQA
View on GitHub
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models
☆15Feb 20, 2025Updated last year
tao-bai / ntu-poster-latex-template
View on GitHub
unofficial NTU academic poster latex template.
☆16Jul 12, 2021Updated 5 years ago
shbyun080 / OneNet
View on GitHub
Official Implementation of OneNet
☆22Oct 16, 2025Updated 9 months ago
captcha-recognition / ocr_dataset
View on GitHub
常用的ocr数据集
☆16Nov 15, 2021Updated 4 years ago
zigzagcai / varlen_mamba
View on GitHub
Mamba SSM architecture that supports training on variable-length sequences
☆12Sep 1, 2025Updated 10 months ago
INFINIQ-AI1 / CLIPVQDiffusion
View on GitHub
official implementation of "CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusi…
☆19Sep 5, 2024Updated last year