A comprehensive list [Hi-SAM@TPAMI'24, GoMatching@NeurIPS'24, DeepSolo(++)@ CVPR'23, DPText-DETR@AAAI'23, I3CL@IJCV'22] of our research works related to scene text detection, spotting, etc., including papers, codes.
☆93Nov 12, 2024Updated last year
Alternatives and similar repositories for ViTAE-Transformer-Scene-Text-Detection
Users that are interested in ViTAE-Transformer-Scene-Text-Detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer☆200Aug 31, 2023Updated 2 years ago
- The official repo for the technical report "Scalable Mask Annotation for Video Text Spotting"☆16May 3, 2023Updated 2 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Nov 25, 2022Updated 3 years ago
- Arbitrary Shape Text Detection via Segmentation with Probability Maps; accepted by TPAMI2022☆104Jun 30, 2023Updated 2 years ago
- Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)☆102Jun 28, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆27Nov 29, 2023Updated 2 years ago
- The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++:…☆284May 30, 2025Updated 9 months ago
- [arXiv: 2505.12307] LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?☆35Dec 1, 2025Updated 3 months ago
- ☆31Dec 18, 2025Updated 3 months ago
- [NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching☆31May 29, 2025Updated 9 months ago
- (CVPR 2022) Text Spotting Transformers☆190Jan 30, 2023Updated 3 years ago
- Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)☆201Jun 17, 2024Updated last year
- ☆15Nov 26, 2023Updated 2 years ago
- [ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective☆202Nov 1, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vis…☆281Sep 25, 2025Updated 6 months ago
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆32Jun 12, 2025Updated 9 months ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- Official implementation of SPTS: Single-Point Text Spotting (ACM MM 2022 Oral)☆144Jul 26, 2023Updated 2 years ago
- ☆38Oct 20, 2023Updated 2 years ago
- Unconstrained Text Detection with Box Supervisionand Dynamic Self-Training☆34Nov 24, 2022Updated 3 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation☆205May 23, 2025Updated 10 months ago
- [IEEE TPAMI] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation☆350May 30, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (…☆288Nov 29, 2024Updated last year
- [CVPR 2023] Referring Image Matting☆208Apr 17, 2023Updated 2 years ago
- PERT: A Progressively Region-based Network for Scene Text Removal (TIP2023)☆37Aug 11, 2023Updated 2 years ago
- ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting☆90Feb 11, 2023Updated 3 years ago
- Official Pytorch implementations of TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition (IJCAI 2023)☆43Aug 13, 2023Updated 2 years ago
- ☆16Jan 30, 2022Updated 4 years ago
- The dataset used in the CVPR 2022 paper (SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Norm…☆34Jun 21, 2022Updated 3 years ago
- Official implementations of PSENet, PAN and PAN++.☆452Mar 9, 2023Updated 3 years ago
- A new video text spotting framework with Transformer☆81May 23, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- CRAFT(Baek et al., 2019) model training code☆52Aug 10, 2024Updated last year
- ☆29Aug 31, 2022Updated 3 years ago
- Self-attention based Text Knowledge Mining for Text Detection☆47Mar 7, 2023Updated 3 years ago
- [MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"☆16Nov 3, 2023Updated 2 years ago
- This is the pytorch implementation of FCL-Net, accepted by NN'2022.☆14May 25, 2022Updated 3 years ago
- ☆18Jul 24, 2025Updated 8 months ago
- STIRER: A Unified Model for Low-Resolution Scene Text Image Recovery and Recognition -- ACMMM 2023☆14Dec 2, 2024Updated last year