PriNing/ODM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PriNing/ODM)

PriNing / ODM

ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting

☆45

Alternatives and similar repositories for ODM

Users that are interested in ODM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bytedance / oclip
View on GitHub
☆53Nov 4, 2022Updated 3 years ago
bytedance / SPTSv2
View on GitHub
The official implementation of SPTS v2: Single-Point Text Spotting
☆138Jun 29, 2023Updated 3 years ago
retsuh-bqw / SRFormer-Text-Det
View on GitHub
[AAAI 2024] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression
☆70Feb 21, 2025Updated last year
TongkunGuan / SIGA
View on GitHub
[CVPR2023] Self-supervised Implicit Glyph Attention for Text Recognition
☆110Mar 9, 2025Updated last year
wenwenyu / TCM
View on GitHub
Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)
☆202Jun 17, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OPPO-Mente-Lab / GlyphDraw
View on GitHub
Text-To-Image Generation with Chinese Characters
☆133Jul 20, 2023Updated 3 years ago
clovaai / units
View on GitHub
☆78Aug 7, 2023Updated 2 years ago
luanshiyinyang / ChineseOCR
View on GitHub
端到端的中文场景文字识别。
☆12Jun 27, 2022Updated 4 years ago
ViTAE-Transformer / DeepSolo
View on GitHub
The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++:…
☆294May 30, 2025Updated last year
bytedance / E2STR
View on GitHub
The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
☆55Jun 14, 2024Updated 2 years ago
ecnuljzhang / brush-your-text
View on GitHub
☆99Jan 3, 2024Updated 2 years ago
ZYM-PKU / UDiffText
View on GitHub
[ECCV 2024] Official repo for UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diff…
☆236Feb 14, 2025Updated last year
Token-family / TokenFD
View on GitHub
[ICCV2025] A Token-level Text Image Foundation Model for Document Understanding
☆135Aug 27, 2025Updated 10 months ago
99Franklin / DiffText
View on GitHub
☆16Jan 10, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
mxin262 / ESTextSpotter
View on GitHub
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
☆78Apr 9, 2024Updated 2 years ago
wzx99 / CLIPOCR
View on GitHub
☆38Oct 20, 2023Updated 2 years ago
TenMilesLotus / DTSM
View on GitHub
Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator
☆13Apr 28, 2024Updated 2 years ago
TongkunGuan / CCD
View on GitHub
[ICCV2023] Self-supervised Character-to-Character Distillation for Text Recognition
☆153Jul 12, 2026Updated last week
xdxie / WAS_WordArt-Segmentation
View on GitHub
The official codes and datasets for Artistic Text Segmentation (ECCV 2024).
☆30Sep 24, 2025Updated 9 months ago
DrLuo / RTM
View on GitHub
The official repository of Real Text Manipulation (RTM)
☆46Mar 18, 2025Updated last year
FaltingsA / SSM
View on GitHub
[IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition
☆10Aug 10, 2025Updated 11 months ago
ymy-k / Hi-SAM
View on GitHub
[IEEE TPAMI] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
☆365May 30, 2025Updated last year
Mountchicken / Union14M
View on GitHub
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
☆206Nov 1, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
irisXcoding / DocReal
View on GitHub
DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction
☆30Jun 28, 2023Updated 3 years ago
amazon-science / glass-text-spotting
View on GitHub
Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)
☆102Jun 28, 2024Updated 2 years ago
UCSB-NLP-Chang / DiffSTE
View on GitHub
☆102Aug 1, 2024Updated last year
large-ocr-model / large-ocr-model.github.io
View on GitHub
☆189Feb 27, 2024Updated 2 years ago
buaacxf / VIPTR
View on GitHub
☆44Jul 9, 2024Updated 2 years ago
shannanyinxiang / SPTS
View on GitHub
Official implementation of SPTS: Single-Point Text Spotting (ACM MM 2022 Oral)
☆145Jul 26, 2023Updated 2 years ago
VamosC / CLIP4STR
View on GitHub
An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".
☆152Nov 14, 2025Updated 8 months ago
omtcyang / iOPENText
View on GitHub
resources for text detection, text recognition, and end to end text spotting
☆13Apr 23, 2023Updated 3 years ago
mxin262 / Bridging-Text-Spotting
View on GitHub
(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.
☆75Jun 11, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
MODCT / Celery-LaTex-OCR
View on GitHub
Another LaTex formula OCR tool
☆15Feb 15, 2023Updated 3 years ago
czczup / FAST
View on GitHub
Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation
☆206May 23, 2025Updated last year
weichaozeng / TextCtrl
View on GitHub
[2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
☆105Mar 16, 2025Updated last year
ZeningLin / PEneo
View on GitHub
[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.
☆41Apr 7, 2025Updated last year
eezkni / ESIM
View on GitHub
[TIP-2017] Official MATLAB implementation of the "ESIM: Edge Similarity for Screen Content Image Quality Assessment"
☆11May 30, 2026Updated last month
Code-of-Liujie / YOLOv8-QR
View on GitHub
☆13Mar 25, 2024Updated 2 years ago
fabio-sim / DocShadow-ONNX-TensorRT
View on GitHub
ONNX-compatible DocShadow: High-Resolution Document Shadow Removal. Supports TensorRT 🚀
☆25Sep 13, 2023Updated 2 years ago