(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.
☆74Jun 11, 2024Updated last year
Alternatives and similar repositories for Bridging-Text-Spotting
Users that are interested in Bridging-Text-Spotting are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- (ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer☆78Apr 9, 2024Updated last year
- ☆22May 30, 2023Updated 2 years ago
- Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)☆201Jun 17, 2024Updated last year
- [ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective☆202Nov 1, 2023Updated 2 years ago
- VimTS: A Unified Video and Image Text Spotter☆78Nov 10, 2024Updated last year
- ☆27Nov 29, 2023Updated 2 years ago
- [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…☆20Dec 4, 2024Updated last year
- Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)☆102Jun 28, 2024Updated last year
- This repository is the implementation of "Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Contex…☆96Feb 21, 2023Updated 3 years ago
- (CVPR 2022) Text Spotting Transformers☆190Jan 30, 2023Updated 3 years ago
- ☆27Feb 20, 2024Updated 2 years ago
- Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features …☆74Jun 24, 2023Updated 2 years ago
- The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer☆55Jun 14, 2024Updated last year
- ☆44Jul 9, 2024Updated last year
- Official PyTorch implementation of `Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition`☆74Feb 27, 2023Updated 3 years ago
- The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++:…☆284May 30, 2025Updated 9 months ago
- [arXiv: 2505.12307] LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?☆35Dec 1, 2025Updated 3 months ago
- Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (…☆288Nov 29, 2024Updated last year
- [NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching☆31May 29, 2025Updated 9 months ago
- [AAAI 2025] DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming