ViTAE-Transformer/ViTAE-Transformer-Scene-Text-Detection

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ViTAE-Transformer/ViTAE-Transformer-Scene-Text-Detection)

ViTAE-Transformer / ViTAE-Transformer-Scene-Text-Detection

A comprehensive list [Hi-SAM@TPAMI'24, GoMatching@NeurIPS'24, DeepSolo(++)@ CVPR'23, DPText-DETR@AAAI'23, I3CL@IJCV'22] of our research works related to scene text detection, spotting, etc., including papers, codes.

☆94

Alternatives and similar repositories for ViTAE-Transformer-Scene-Text-Detection

Users that are interested in ViTAE-Transformer-Scene-Text-Detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ymy-k / DPText-DETR
View on GitHub
[AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
☆204Aug 31, 2023Updated 2 years ago
Hxyz-123 / GoMatching
View on GitHub
[NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching
☆34May 29, 2025Updated last year
ViTAE-Transformer / SAMText
View on GitHub
The official repo for the technical report "Scalable Mask Annotation for Video Text Spotting"
☆16May 3, 2023Updated 3 years ago
LARS-research / TREFE
View on GitHub
Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022
☆13Nov 25, 2022Updated 3 years ago
amazon-science / glass-text-spotting
View on GitHub
Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)
☆102Jun 28, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ViTAE-Transformer / DeepSolo
View on GitHub
The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++:…
☆294May 30, 2025Updated last year
GXYM / TextPMs
View on GitHub
Arbitrary Shape Text Detection via Segmentation with Probability Maps; accepted by TPAMI2022
☆104Jun 30, 2023Updated 3 years ago
Wei-ucas / TPSNet
View on GitHub
☆28Nov 29, 2023Updated 2 years ago
Hxyz-123 / ReasoningOCR
View on GitHub
☆18Jul 24, 2025Updated last year
mlpc-ucsd / TESTR
View on GitHub
(CVPR 2022) Text Spotting Transformers
☆192Jan 30, 2023Updated 3 years ago
namtuanly / WikiTableSet
View on GitHub
WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia
☆32Jun 12, 2025Updated last year
wenwenyu / TCM
View on GitHub
Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)
☆202Jun 17, 2024Updated 2 years ago
HCIILAB / M5HisDoc
View on GitHub
☆34Dec 18, 2025Updated 7 months ago
iiclab / DecompST
View on GitHub
☆15Nov 26, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ViTAE-Transformer / ViTAE-Transformer
View on GitHub
The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vis…
☆279Apr 15, 2026Updated 3 months ago
Mountchicken / Union14M
View on GitHub
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
☆206Nov 1, 2023Updated 2 years ago
shannanyinxiang / SPTS
View on GitHub
Official implementation of SPTS: Single-Point Text Spotting (ACM MM 2022 Oral)
☆145Jul 26, 2023Updated 3 years ago
onealwj / MVLT
View on GitHub
PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition
☆28Nov 11, 2022Updated 3 years ago
ymy-k / Hi-SAM
View on GitHub
[IEEE TPAMI] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
☆368May 30, 2025Updated last year
wzx99 / CLIPOCR
View on GitHub
☆38Oct 20, 2023Updated 2 years ago
DCGM / SoftCTC
View on GitHub
This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135
☆19Mar 7, 2023Updated 3 years ago
weijiawu / Polygon-free-Unconstrained-Scene-Text-Detection-with-Box-Annotations
View on GitHub
Unconstrained Text Detection with Box Supervisionand Dynamic Self-Training
☆34Nov 24, 2022Updated 3 years ago
czczup / FAST
View on GitHub
Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation
☆206May 23, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
JizhiziLi / RIM
View on GitHub
[CVPR 2023] Referring Image Matting
☆208Apr 17, 2023Updated 3 years ago
mxin262 / SwinTextSpotter
View on GitHub
Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (…
☆289Nov 29, 2024Updated last year
wangyuxin87 / PERT
View on GitHub
PERT: A Progressively Region-based Network for Scene Text Removal (TIP2023)
☆37Aug 11, 2023Updated 2 years ago
FangShancheng / ABINet-PP
View on GitHub
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting
☆90Feb 11, 2023Updated 3 years ago
simplify23 / TPS_PP
View on GitHub
Official Pytorch implementations of TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition (IJCAI 2023）
☆42Aug 13, 2023Updated 2 years ago
Pay20Y / PIMNet
View on GitHub
☆16Jan 30, 2022Updated 4 years ago
gmuffiness / CRAFT-train
View on GitHub
CRAFT(Baek et al., 2019) model training code
☆53Aug 10, 2024Updated last year
whai362 / pan_pp.pytorch
View on GitHub
Official implementations of PSENet, PAN and PAN++.
☆453Mar 9, 2023Updated 3 years ago
Canjie-Luo / Real-300K
View on GitHub
The dataset used in the CVPR 2022 paper (SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Norm…
☆34Jun 21, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
weijiawu / TransVTSpotter
View on GitHub
A new video text spotting framework with Transformer
☆82May 23, 2022Updated 4 years ago
MiliLab / LogicOCR
View on GitHub
[arXiv: 2505.12307] LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?
☆35Dec 1, 2025Updated 7 months ago
shengtao96 / CentripetalText
View on GitHub
☆29Aug 31, 2022Updated 3 years ago
CVI-SZU / STKM
View on GitHub
Self-attention based Text Knowledge Mining for Text Detection
☆47Mar 7, 2023Updated 3 years ago
Xiaomeng-Yang / STR_benchmark_cleansed
View on GitHub
☆14May 26, 2023Updated 3 years ago
DREAMXFAR / FCL-Net
View on GitHub
This is the pytorch implementation of FCL-Net, accepted by NN'2022.
☆15May 25, 2022Updated 4 years ago
mxin262 / ESTextSpotter
View on GitHub
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
☆78Apr 9, 2024Updated 2 years ago