bytedance/E2STR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bytedance/E2STR)

bytedance / E2STR

The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer

☆55

Alternatives and similar repositories for E2STR

Users that are interested in E2STR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zzyhlyoko / DCTC
View on GitHub
☆42Sep 2, 2023Updated 2 years ago
Mountchicken / Union14M
View on GitHub
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
☆206Nov 1, 2023Updated 2 years ago
Alpha-Innovator / DocParser
View on GitHub
☆18Jan 13, 2025Updated last year
ReverseSystem001 / crnn_centerloss_pytorch
View on GitHub
pytorch crnn with centerloss to solve the near word problem
☆16Jan 27, 2022Updated 4 years ago
dali92002 / SSL-OCR
View on GitHub
Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023
☆30Jul 12, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
bytedance / SPTSv2
View on GitHub
The official implementation of SPTS v2: Single-Point Text Spotting
☆138Jun 29, 2023Updated 3 years ago
mxin262 / Bridging-Text-Spotting
View on GitHub
(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.
☆75Jun 11, 2024Updated 2 years ago
large-ocr-model / large-ocr-model.github.io
View on GitHub
☆189Feb 27, 2024Updated 2 years ago
clovaai / units
View on GitHub
☆78Aug 7, 2023Updated 2 years ago
tobiasvanderwerff / MetaHTR
View on GitHub
Unofficial implementation of the paper "MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition" by Bhunia et al. (2021).
☆14Jun 22, 2022Updated 4 years ago
TenMilesLotus / DTSM
View on GitHub
Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator
☆13Apr 28, 2024Updated 2 years ago
bytedance / MTVQA
View on GitHub
MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…
☆64May 15, 2025Updated last year
wenwenyu / TCM
View on GitHub
Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)
☆202Jun 17, 2024Updated 2 years ago
simplify23 / MRN
View on GitHub
Official Pytorch implementations of MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition (ICCV 2023)
☆46Sep 26, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ZZR8066 / SEM
View on GitHub
☆19Mar 10, 2023Updated 3 years ago
onealwj / MVLT
View on GitHub
PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition
☆28Nov 11, 2022Updated 3 years ago
SCUT-DLVCLab / OCR-Reasoning
View on GitHub
[ICLR 2026] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning
☆76May 26, 2026Updated last month
LARS-research / TREFE
View on GitHub
Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022
☆13Nov 25, 2022Updated 3 years ago
Levi-ZJY / SAN
View on GitHub
SAN: Structure-Aware Network for Complex and Long-tailed Chinese Text Recognition
☆10Apr 8, 2024Updated 2 years ago
MelosY / CAM
View on GitHub
☆27Feb 20, 2024Updated 2 years ago
sakura2233565548 / TabPedia
View on GitHub
This repository is the codebase of TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy
☆51Oct 16, 2024Updated last year
PriNing / ODM
View on GitHub
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
☆45Apr 11, 2025Updated last year
lcy0604 / QT-TextSR
View on GitHub
This repository is the implementation of "QT-TextSR: Enhancing scene text image super-resolution via efficient interaction with text reco…
☆20Jul 9, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
buaacxf / VIPTR
View on GitHub
☆44Jul 9, 2024Updated 2 years ago
computational-imaging / diffusion-in-the-dark
View on GitHub
Repository for Diffusion in the Dark (WACV 2024)
☆25Nov 6, 2023Updated 2 years ago
blackprotoss / CIRI
View on GitHub
Reproducing the Past: A Dataset for Benchmarking Inscription Restoration (ACM MM'24)
☆14Oct 15, 2025Updated 9 months ago
Caiyuan-Zheng / Consistency_Regularization_STR
View on GitHub
It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.
☆28Jul 6, 2022Updated 4 years ago
VamosC / CLIP4STR
View on GitHub
An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".
☆152Nov 14, 2025Updated 8 months ago
wzx99 / CLIPOCR
View on GitHub
☆38Oct 20, 2023Updated 2 years ago
shannanyinxiang / SPTS
View on GitHub
Official implementation of SPTS: Single-Point Text Spotting (ACM MM 2022 Oral)
☆145Jul 26, 2023Updated 2 years ago
baudm / parseq
View on GitHub
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
☆727May 29, 2024Updated 2 years ago
markytools / strexp
View on GitHub
STRExp is a framework that provides Explainability (XAI) to Scene Text Recognition (STR) models.
☆11Nov 27, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Yuliang-Liu / SPTSv2
View on GitHub
☆22May 30, 2023Updated 3 years ago
ecnuljzhang / brush-your-text
View on GitHub
☆99Jan 3, 2024Updated 2 years ago
CyrilSterling / LPV
View on GitHub
The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)
☆26Sep 3, 2023Updated 2 years ago
HCIILAB / Scene-Text-Recognition-Recommendations
View on GitHub
Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining
☆353Nov 29, 2023Updated 2 years ago
lqzxt / NGTR
View on GitHub
☆14May 26, 2025Updated last year
amazon-science / textadain-robust-recognition
View on GitHub
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
☆21Jul 26, 2022Updated 3 years ago
shannanyinxiang / PageNet
View on GitHub
Official implementation of PageNet (IJCV 2022)
☆82Oct 31, 2022Updated 3 years ago