retsuh-bqw/SRFormer-Text-Det

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/retsuh-bqw/SRFormer-Text-Det)

retsuh-bqw / SRFormer-Text-Det

[AAAI 2024] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression

☆70

Alternatives and similar repositories for SRFormer-Text-Det

Users that are interested in SRFormer-Text-Det are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

D641593 / MixNet
View on GitHub
☆92Feb 9, 2025Updated last year
ymy-k / DPText-DETR
View on GitHub
[AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
☆204Aug 31, 2023Updated 2 years ago
ychensu / LRANet
View on GitHub
[AAAI'24 Oral] LRANet: Towards Accurate and Efficient Scene Text Detection with Low-Rank Approximation Network
☆48Nov 28, 2025Updated 7 months ago
wenwenyu / TCM
View on GitHub
Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)
☆202Jun 17, 2024Updated 2 years ago
bytedance / SPTSv2
View on GitHub
The official implementation of SPTS v2: Single-Point Text Spotting
☆138Jun 29, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Mountchicken / Union14M
View on GitHub
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
☆206Nov 1, 2023Updated 2 years ago
GXYM / TextBPN
View on GitHub
Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection； Accepted by ICCV2021；The paper at: http://arxiv.org/abs/2107.12664
☆118Jun 30, 2023Updated 3 years ago
ViTAE-Transformer / DeepSolo
View on GitHub
The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++:…
☆294May 30, 2025Updated last year
byeonghu-na / MATRN
View on GitHub
Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features …
☆74Jun 24, 2023Updated 3 years ago
czczup / FAST
View on GitHub
Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation
☆206May 23, 2025Updated last year
HCIILAB / Scene-Text-Recognition-Recommendations
View on GitHub
Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining
☆353Nov 29, 2023Updated 2 years ago
mxin262 / ESTextSpotter
View on GitHub
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
☆78Apr 9, 2024Updated 2 years ago
PriNing / ODM
View on GitHub
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
☆45Apr 11, 2025Updated last year
shannanyinxiang / UPOCR
View on GitHub
Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)
☆69Jun 6, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
namtuanly / WikiTableSet
View on GitHub
WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia
☆32Jun 12, 2025Updated last year
shannanyinxiang / ViTEraser
View on GitHub
Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20…
☆66Jul 4, 2024Updated 2 years ago
clovaai / units
View on GitHub
☆78Aug 7, 2023Updated 2 years ago
GXYM / TextPMs
View on GitHub
Arbitrary Shape Text Detection via Segmentation with Probability Maps; accepted by TPAMI2022
☆104Jun 30, 2023Updated 3 years ago
wzx99 / TMIM
View on GitHub
☆13Oct 17, 2024Updated last year
FelixHertlein / inv3d-model
View on GitHub
Inference, training and evaluation code for our models from the paper "Inv3D: a high-resolution 3D invoice dataset for template-guided si…
☆61Feb 7, 2024Updated 2 years ago
zzyhlyoko / DCTC
View on GitHub
☆42Sep 2, 2023Updated 2 years ago
mrlooi / convert_to_coco
View on GitHub
Scripts for converting various datasets to MSCOCO annotation (json) files
☆12Jun 5, 2019Updated 7 years ago
google-research-datasets / hiertext
View on GitHub
The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and p…
☆315Dec 2, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
DrLuo / SemiETS
View on GitHub
【CVPR 2025】SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting
☆17Jul 1, 2025Updated last year
GXYM / TextBPN-Plus-Plus
View on GitHub
Arbitrary Shape Text Detection via Boundary Transformer；The paper at: https://arxiv.org/abs/2205.05320, which has been accepted by IEEE T…
☆203Nov 5, 2025Updated 8 months ago
mlpc-ucsd / TESTR
View on GitHub
(CVPR 2022) Text Spotting Transformers
☆192Jan 30, 2023Updated 3 years ago
wzx99 / CLIPOCR
View on GitHub
☆38Oct 20, 2023Updated 2 years ago
corleonechensiyu / pytorch_AdvancedEast
View on GitHub
pytorch实现AdvancedEast+mobilenetv3
☆26Dec 25, 2019Updated 6 years ago
SJTU-DeepVisionLab / FreeReal
View on GitHub
[ECCV2024] Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors
☆19Sep 7, 2024Updated last year
VamosC / CLIP4STR
View on GitHub
An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".
☆152Nov 14, 2025Updated 8 months ago
yeungchenwa / OCR-SAM
View on GitHub
[Open-Source Project] Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instance…
☆590Jan 30, 2024Updated 2 years ago
TencentARC / BTS
View on GitHub
BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild
☆33Apr 16, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
Gyann-z / FDP
View on GitHub
☆16Apr 21, 2025Updated last year
SCUT-DLVCLab / OCR-Reasoning
View on GitHub
[ICLR 2026] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning
☆76May 26, 2026Updated last month
yflv-yanxia / scene_text
View on GitHub
☆173Apr 25, 2024Updated 2 years ago
FudanVI / FudanOCR
View on GitHub
A toolbox of scene text super-resolution and recognition
☆436Jul 25, 2024Updated last year
PFCCLab / StyleText
View on GitHub
Style-Text data synthesis tool
☆80Dec 9, 2024Updated last year
amazon-science / glass-text-spotting
View on GitHub
Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)
☆102Jun 28, 2024Updated 2 years ago
TongkunGuan / CCD
View on GitHub
[ICCV2023] Self-supervised Character-to-Character Distillation for Text Recognition
☆153Jul 12, 2026Updated last week