FaltingsA / SSMLinks
[IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition
☆10Updated 2 months ago
Alternatives and similar repositories for SSM
Users that are interested in SSM are comparing it to the libraries listed below
Sorting:
- [ICCV2023] Self-supervised Character-to-Character Distillation for Text Recognition☆151Updated last year
- [ECCV2024] Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors☆19Updated last year
- [CVPR2023] Self-supervised Implicit Glyph Attention for Text Recognition☆108Updated 7 months ago
- Update the latest text-related papers from top conferences☆26Updated 7 months ago
- ☆21Updated 9 months ago
- Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023☆160Updated last year
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆43Updated 6 months ago
- Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval --ICCV2023 Oral☆91Updated last year
- [ICCV 2023] Few shot font generation via transferring similarity guided global and quantization local styles☆146Updated last month
- [TCSVT2022] Industria Scene Text Detection☆82Updated 2 years ago
- Balanced Classification: A Unified Framework for Long-Tailed Object Detection (TMM 2023)☆100Updated 6 months ago
- ☆16Updated 2 months ago
- [NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching☆26Updated 4 months ago
- What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness☆24Updated 5 months ago
- [CVPR 2025] PyTorch implementation of Diff-II☆19Updated 7 months ago
- [CVPR 2024] SimDA: Simple Diffusion Adapter for Efficient Video Generation☆129Updated last year
- [AAAI 2024] TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training☆103Updated last year
- [ECCV2024] PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer☆82Updated 6 months ago
- MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023☆80Updated last year
- [ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions