onealwj / MVLTLinks
PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition
☆29Updated 2 years ago
Alternatives and similar repositories for MVLT
Users that are interested in MVLT are comparing it to the libraries listed below
Sorting:
- ☆14Updated 2 years ago
- ☆16Updated 3 years ago
- The dataset used in the CVPR 2022 paper (SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Norm…☆33Updated 3 years ago
- ☆22Updated 3 years ago
- ☆26Updated last year
- [ACM MM 2020] Exploring Font-independent Features for Scene Text Recognition☆44Updated 4 years ago
- STIRER: A Unified Model for Low-Resolution Scene Text Image Recovery and Recognition -- ACMMM 2023☆13Updated 7 months ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Updated 2 years ago
- ☆30Updated last year
- HHH☆34Updated 3 years ago
- ☆38Updated last year
- ☆41Updated 5 years ago
- [NeurIPS2021] BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting☆68Updated last year
- ☆25Updated last year
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Updated 2 years ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆29Updated 2 years ago
- Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features …☆70Updated 2 years ago
- Unconstrained Text Detection with Box Supervisionand Dynamic Self-Training☆33Updated 2 years ago
- [MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"☆15Updated last year
- ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting☆86Updated 2 years ago
- PERT: A Progressively Region-based Network for Scene Text Removal (TIP2023)☆35Updated last year
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.☆28Updated 3 years ago
- TIoU metric in python3. Forked from https://github.com/Yuliang-Liu/TIoU-metric.☆26Updated 5 years ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆83Updated last year
- ☆29Updated 2 years ago
- ☆42Updated last year
- ☆41Updated last year
- Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)☆12Updated 3 years ago
- Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)☆102Updated last year
- [AAAI 2024] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression☆66Updated 4 months ago