mxin262/Bridging-Text-Spotting

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mxin262/Bridging-Text-Spotting)

mxin262 / Bridging-Text-Spotting

(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.

☆75

Alternatives and similar repositories for Bridging-Text-Spotting

Users that are interested in Bridging-Text-Spotting are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mxin262 / ESTextSpotter
View on GitHub
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
☆78Apr 9, 2024Updated 2 years ago
Yuliang-Liu / SPTSv2
View on GitHub
☆22May 30, 2023Updated 3 years ago
wenwenyu / TCM
View on GitHub
Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)
☆202Jun 17, 2024Updated 2 years ago
Mountchicken / Union14M
View on GitHub
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
☆206Nov 1, 2023Updated 2 years ago
Yuliang-Liu / VimTS
View on GitHub
VimTS: A Unified Video and Image Text Spotter
☆79Nov 10, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Wei-ucas / TPSNet
View on GitHub
☆28Nov 29, 2023Updated 2 years ago
SCUT-DLVCLab / RFUND
View on GitHub
[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…
☆21Dec 4, 2024Updated last year
amazon-science / glass-text-spotting
View on GitHub
Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)
☆102Jun 28, 2024Updated 2 years ago
MelosY / CAM
View on GitHub
☆27Feb 20, 2024Updated 2 years ago
lcy0604 / CTRNet
View on GitHub
This repository is the implementation of "Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Contex…
☆97Feb 21, 2023Updated 3 years ago
mlpc-ucsd / TESTR
View on GitHub
(CVPR 2022) Text Spotting Transformers
☆192Jan 30, 2023Updated 3 years ago
ymy-k / DPText-DETR
View on GitHub
[AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
☆204Aug 31, 2023Updated 2 years ago
buaacxf / VIPTR
View on GitHub
☆44Jul 9, 2024Updated 2 years ago
byeonghu-na / MATRN
View on GitHub
Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features …
☆74Jun 24, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
bytedance / E2STR
View on GitHub
The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
☆55Jun 14, 2024Updated 2 years ago
ayumiymk / DiG
View on GitHub
Official PyTorch implementation of `Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition`
☆74Feb 27, 2023Updated 3 years ago
ViTAE-Transformer / DeepSolo
View on GitHub
The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++:…
☆294May 30, 2025Updated last year
mxin262 / SwinTextSpotter
View on GitHub
Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (…
☆289Nov 29, 2024Updated last year
SCUT-DLVCLab / OCR-Reasoning
View on GitHub
[ICLR 2026] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning
☆76May 26, 2026Updated last month
shannanyinxiang / SPTS
View on GitHub
Official implementation of SPTS: Single-Point Text Spotting (ACM MM 2022 Oral)
☆145Jul 26, 2023Updated 2 years ago
zhangyifei01 / LMIM
View on GitHub
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
☆15Oct 26, 2025Updated 8 months ago
99Franklin / DiffText
View on GitHub
☆16Jan 10, 2025Updated last year
google-research-datasets / hiertext
View on GitHub
The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and p…
☆316Dec 2, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ymy-k / Hi-SAM
View on GitHub
[IEEE TPAMI] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
☆366May 30, 2025Updated last year
shannanyinxiang / UPOCR
View on GitHub
Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)
☆69Jun 6, 2024Updated 2 years ago
bytedance / SPTSv2
View on GitHub
The official implementation of SPTS v2: Single-Point Text Spotting
☆138Jun 29, 2023Updated 3 years ago
shannanyinxiang / PageNet
View on GitHub
Official implementation of PageNet (IJCV 2022)
☆82Oct 31, 2022Updated 3 years ago
csguoh / KD-LTR
View on GitHub
[MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"
☆16Nov 3, 2023Updated 2 years ago
OPPO-Mente-Lab / GlyphDraw
View on GitHub
Text-To-Image Generation with Chinese Characters
☆133Jul 20, 2023Updated 3 years ago
weijiawu / TransDETR
View on GitHub
[IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer
☆114Mar 28, 2024Updated 2 years ago
facebookresearch / MultiplexedOCR
View on GitHub
Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR
☆80Dec 2, 2022Updated 3 years ago
ZZZHANG-jx / DocKylin
View on GitHub
[AAAI 2025] DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming
☆36Jun 1, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
TenMilesLotus / DTSM
View on GitHub
Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator
☆13Apr 28, 2024Updated 2 years ago
mrlooi / convert_to_coco
View on GitHub
Scripts for converting various datasets to MSCOCO annotation (json) files
☆12Jun 5, 2019Updated 7 years ago
HCIILAB / Scene-Text-Recognition-Recommendations
View on GitHub
Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining
☆353Nov 29, 2023Updated 2 years ago
SCUT-DLVCLab / GPT-4V_OCR
View on GitHub
Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)
☆128Nov 13, 2023Updated 2 years ago
duxiangcheng / SAEN
View on GitHub
Modeling Stroke Mask for End-to-End Text Erasing
☆20Feb 9, 2023Updated 3 years ago
DrLuo / SemiETS
View on GitHub
【CVPR 2025】SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting
☆17Jul 1, 2025Updated last year
SCUT-DLVCLab / SCUT-EnsExam
View on GitHub
SCUT-EnsExam is a real-world handwritten text erasure dataset for examination paper scenarios, which consists of 545 examination paper im…
☆21Jul 17, 2026Updated last week