Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration (CVPR2023)"
☆10May 15, 2024Updated last year
Alternatives and similar repositories for PSSR
Users that are interested in PSSR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆27Feb 20, 2024Updated 2 years ago
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.☆28Jul 6, 2022Updated 3 years ago
- ☆14May 26, 2023Updated 2 years ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Feb 16, 2023Updated 3 years ago
- Crafting Adversarial Examples for Neural Machine Translation☆10Apr 7, 2023Updated 2 years ago
- (ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer☆78Apr 9, 2024Updated last year
- ☆32Aug 14, 2023Updated 2 years ago
- Repository sharing code and the model for the paper "Rescoring Sequence-to-Sequence Models for Text Line Recognition with CTC-Prefixes"☆17Oct 13, 2021Updated 4 years ago
- RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering☆10Nov 27, 2022Updated 3 years ago
- ☆19Mar 10, 2023Updated 3 years ago
- [MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"☆16Nov 3, 2023Updated 2 years ago
- A clone from Max Jaderberg's Text Renderer☆34Jun 16, 2016Updated 9 years ago
- Code for MICCAI2023 paper: TransLiver: A Hybrid Transformer Model for Multi-phase Liver Lesion Classification☆18Jan 10, 2024Updated 2 years ago
- Local Temperature Scaling for Probability Calibration☆22Nov 26, 2021Updated 4 years ago
- Attention-based sequence-to-sequence model for handwritten word recognition☆64Sep 22, 2024Updated last year
- Scene text rectification using glyph and character alignment properties☆22Jan 21, 2018Updated 8 years ago
- ☆16Jan 30, 2022Updated 4 years ago
- Code for the paper "Reliability in Semantic Segmentation: Are We on the Right Track?", CVPR 2023☆23Jul 8, 2024Updated last year
- ☆44Jul 9, 2024Updated last year
- ☆38Oct 20, 2023Updated 2 years ago
- 第二届广州·琶洲算法大赛-智能交通CV模型赛题第4名方案☆11Aug 9, 2023Updated 2 years ago
- H. Zhang, Q. Yao, M. Yang, Y. Xu, X. Bai. AutoSTR: Efficient Backbone Search for Scene Text Recognition. European Conference on Computer …☆84Aug 6, 2020Updated 5 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆110Dec 4, 2024Updated last year
- ☆12Mar 7, 2019Updated 7 years ago
- ☆12Jun 11, 2023Updated 2 years ago
- Code for "Optimizing risk-based breast cancer screening policies with reinforcement learning"☆24Jan 13, 2022Updated 4 years ago
- A Better Way to Attend: Attention with Trees for Video Question Answering☆25Mar 25, 2019Updated 7 years ago
- ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting☆91Feb 11, 2023Updated 3 years ago
- Attention-based sampler in TASN (Trilinear Attention Sampling Network)☆23Jun 8, 2020Updated 5 years ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Jul 26, 2022Updated 3 years ago
- ☆12Aug 25, 2017Updated 8 years ago
- The third-party implement of Encoder Dual Decoder method for table recognition☆13Aug 25, 2021Updated 4 years ago
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Feb 5, 2024Updated 2 years ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆45Apr 3, 2024Updated last year
- An Open-access Dataset for Liver Lesion Diagnosis on Multi-phase MRI☆37Apr 7, 2025Updated 11 months ago
- A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)☆105Dec 9, 2021Updated 4 years ago
- Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…☆56Oct 30, 2024Updated last year