The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
☆55Jun 14, 2024Updated last year
Alternatives and similar repositories for E2STR
Users that are interested in E2STR are comparing it to the libraries listed below
Sorting:
- ☆42Sep 2, 2023Updated 2 years ago
- [ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective☆201Nov 1, 2023Updated 2 years ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆28Jul 12, 2023Updated 2 years ago
- [ICLR 2026] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning☆73Dec 17, 2025Updated 2 months ago
- ☆27Feb 20, 2024Updated 2 years ago
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆74Jun 11, 2024Updated last year
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Nov 25, 2022Updated 3 years ago
- Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator☆12Apr 28, 2024Updated last year
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆27Dec 18, 2025Updated 2 months ago
- The official implementation of SPTS v2: Single-Point Text Spotting☆140Jun 29, 2023Updated 2 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)☆27Sep 3, 2023Updated 2 years ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Jul 26, 2022Updated 3 years ago
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆45Apr 11, 2025Updated 10 months ago
- ☆188Feb 27, 2024Updated 2 years ago
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.☆28Jul 6, 2022Updated 3 years ago
- pytorch crnn with centerloss to solve the near word problem☆16Jan 27, 2022Updated 4 years ago
- Official Pytorch implementations of MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition (ICCV 2023)☆51Sep 26, 2023Updated 2 years ago
- Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)☆200Jun 17, 2024Updated last year
- ☆19Mar 10, 2023Updated 2 years ago
- ☆78Aug 7, 2023Updated 2 years ago
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆63May 15, 2025Updated 9 months ago
- An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".☆146Nov 14, 2025Updated 3 months ago
- ☆87Mar 7, 2025Updated 11 months ago
- ☆44Jul 9, 2024Updated last year
- Data Programming for Text Detection in Documents using SPEAR☆12Mar 26, 2025Updated 11 months ago
- [IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition☆10Aug 10, 2025Updated 6 months ago
- ☆38Oct 20, 2023Updated 2 years ago
- ☆100Jan 3, 2024Updated 2 years ago
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- STRExp is a framework that provides Explainability (XAI) to Scene Text Recognition (STR) models.☆11Nov 27, 2023Updated 2 years ago
- This repository is the codebase of TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy☆50Oct 16, 2024Updated last year
- SAN: Structure-Aware Network for Complex and Long-tailed Chinese Text Recognition☆10Apr 8, 2024Updated last year
- [TIM 2025] Towards Accurate Readings of Water Meters by Eliminating Transition Error: New Dataset and Effective Solution☆12Mar 5, 2025Updated 11 months ago
- Official repository of the paper: "A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition"☆26Jul 10, 2023Updated 2 years ago
- Datasets and Evaluation Scripts for CompHRDoc☆56Feb 25, 2025Updated last year
- Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features …☆73Jun 24, 2023Updated 2 years ago
- Unofficial implementation of the paper "MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition" by Bhunia et al. (2021).☆13Jun 22, 2022Updated 3 years ago
- ☆13May 26, 2025Updated 9 months ago