husterpzh / PSSRLinks
Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration (CVPR2023)"
☆10Updated last year
Alternatives and similar repositories for PSSR
Users that are interested in PSSR are comparing it to the libraries listed below
Sorting:
- ☆14Updated 2 years ago
- ☆10Updated last year
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Updated 2 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Updated 2 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- STIRER: A Unified Model for Low-Resolution Scene Text Image Recovery and Recognition -- ACMMM 2023☆13Updated 6 months ago
- ☆13Updated 5 months ago
- Geometric Augmentation for Text Image☆9Updated 5 years ago
- ☆25Updated last year
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Updated last year
- ☆16Updated 3 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Updated 2 years ago
- [MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"☆15Updated last year
- Unconstrained Text Detection with Box Supervisionand Dynamic Self-Training☆33Updated 2 years ago
- Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)☆12Updated 3 years ago
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆15Updated 3 months ago
- ☆26Updated last year
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Updated 2 years ago
- ☆30Updated last year
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆29Updated 2 years ago
- ☆38Updated 2 years ago
- ☆22Updated 3 years ago
- ☆18Updated 3 years ago
- ☆18Updated 2 years ago
- ☆24Updated last year
- [arXiv: 2505.12307] LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?☆22Updated last month
- ☆38Updated last year
- [FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.☆26Updated 2 years ago
- (ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer☆76Updated last year
- ☆40Updated 11 months ago