FaltingsA / SSMLinks
[IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition
☆10Updated last month
Alternatives and similar repositories for SSM
Users that are interested in SSM are comparing it to the libraries listed below
Sorting:
- [ICCV2023] Self-supervised Character-to-Character Distillation for Text Recognition☆151Updated last year
- [CVPR2023] Self-supervised Implicit Glyph Attention for Text Recognition☆108Updated 6 months ago
- [ECCV2024] Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors☆19Updated last year
- Update the latest text-related papers from top conferences☆26Updated 6 months ago
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆43Updated 5 months ago
- [ICCV 2023] Few shot font generation via transferring similarity guided global and quantization local styles☆145Updated 3 weeks ago
- ☆20Updated 9 months ago
- Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval --ICCV2023 Oral☆91Updated last year
- Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023☆158Updated last year
- [TCSVT2022] Industria Scene Text Detection☆81Updated 2 years ago
- [arXiv 25] Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR☆227Updated 3 weeks ago
- What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness☆23Updated 4 months ago
- 【2024 ECAI】First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending☆14Updated 3 months ago
- [CVPR 2024] SimDA: Simple Diffusion Adapter for Efficient Video Generation☆129Updated last year
- ☆16Updated 2 months ago
- [NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching☆26Updated 3 months ago
- [PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Ca…☆64Updated 2 months ago
- Balanced Classification: A Unified Framework for Long-Tailed Object Detection (TMM 2023)☆100Updated 5 months ago
- [2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control☆90Updated 6 months ago
- New generation of CLIP with fine grained discrimination capability, ICML2025☆299Updated 2 weeks ago
- The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""☆80Updated 3 months ago
- The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer☆53Updated last year
- [CVPR 2025] PyTorch implementation of Diff-II☆16Updated 6 months ago
- ☆39Updated last year
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆68Updated last year
- MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023☆80Updated last year
- [ICCV2025] A Token-level Text Image Foundation Model for Document Understanding☆120Updated 3 weeks ago
- Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)☆198Updated last year
- [ACL 2025 main] The official GitHub page of "Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restorati…☆43Updated this week
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆63Updated last year