shannanyinxiang/ViTEraser

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shannanyinxiang/ViTEraser)

shannanyinxiang / ViTEraser

Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 2024)

☆66

Alternatives and similar repositories for ViTEraser

Users that are interested in ViTEraser are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shannanyinxiang / UPOCR
View on GitHub
Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)
☆69Jun 6, 2024Updated 2 years ago
GuangtaoLyu / FETNet
View on GitHub
FETNet: Feature Erasing and Transferring Network for Scene Text Removal
☆35Jul 18, 2023Updated 3 years ago
lcy0604 / CTRNet
View on GitHub
This repository is the implementation of "Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Contex…
☆97Feb 21, 2023Updated 3 years ago
wzx99 / TMIM
View on GitHub
☆13Oct 17, 2024Updated last year
qqqyd / MOSTEL
View on GitHub
☆60Jul 25, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HCIILAB / M5HisDoc
View on GitHub
☆34Dec 18, 2025Updated 7 months ago
Canjie-Luo / Real-300K
View on GitHub
The dataset used in the CVPR 2022 paper (SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Norm…
☆34Jun 21, 2022Updated 4 years ago
lcy0604 / QT-TextSR
View on GitHub
This repository is the implementation of "QT-TextSR: Enhancing scene text image super-resolution via efficient interaction with text reco…
☆20Jul 9, 2025Updated last year
wangyuxin87 / Tampered-IC13
View on GitHub
-
☆24Oct 25, 2022Updated 3 years ago
HCIILAB / LAST
View on GitHub
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition
☆28Aug 29, 2023Updated 2 years ago
GuangtaoLyu / PSSTRNet
View on GitHub
☆13Jul 28, 2024Updated last year
SCUT-DLVCLab / RFUND
View on GitHub
[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…
☆21Dec 4, 2024Updated last year
shannanyinxiang / SPTS
View on GitHub
Official implementation of SPTS: Single-Point Text Spotting (ACM MM 2022 Oral)
☆145Jul 26, 2023Updated 2 years ago
mxin262 / ESTextSpotter
View on GitHub
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
☆78Apr 9, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
shi-yx / URaG
View on GitHub
Official implementation of URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding (AAAI 2026…
☆43Feb 4, 2026Updated 5 months ago
fh2019ustc / DeepEraser
View on GitHub
The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.
☆53Aug 26, 2024Updated last year
wangyuxin87 / PERT
View on GitHub
PERT: A Progressively Region-based Network for Scene Text Removal (TIP2023)
☆37Aug 11, 2023Updated 2 years ago
HCIILAB / SCUT-EnsText
View on GitHub
☆69Apr 18, 2024Updated 2 years ago
Planet-AI-GmbH / tfaip-hybrid-ctc-s2s
View on GitHub
Repository sharing code and the model for the paper "Rescoring Sequence-to-Sequence Models for Text Line Recognition with CTC-Prefixes"
☆17Oct 13, 2021Updated 4 years ago
yeungchenwa / Recommendations-Diffusion-Text-Image
View on GitHub
A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text remova…
☆273Dec 19, 2024Updated last year
CandleLabAI / TPFNet
View on GitHub
☆11Dec 26, 2022Updated 3 years ago
kyxscut / CG-GAN
View on GitHub
Official PyTorch implementation of the CVPR 2022 paper: "Look Closer to Supervise Better: One-Shot Font Generation via Component-Based Di…
☆94Sep 17, 2022Updated 3 years ago
lcy0604 / CTRNet-plus
View on GitHub
The official implement of CTRNet++.
☆15Dec 30, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Caiyuan-Zheng / Consistency_Regularization_STR
View on GitHub
It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.
☆28Jul 6, 2022Updated 4 years ago
aiwolflow666 / SceneTextRemoval
View on GitHub
Scene text removal via cascaded text stroke detection and erasing
☆35Feb 25, 2023Updated 3 years ago
SCUT-DLVCLab / SCUT-EnsExam
View on GitHub
SCUT-EnsExam is a real-world handwritten text erasure dataset for examination paper scenarios, which consists of 545 examination paper im…
☆21Jul 17, 2026Updated last week
shuyansy / Visual-Text-Processing-survey
View on GitHub
The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""
☆103Oct 20, 2025Updated 9 months ago
shannanyinxiang / PageNet
View on GitHub
Official implementation of PageNet (IJCV 2022)
☆82Oct 31, 2022Updated 3 years ago
TencentARC / BTS
View on GitHub
BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild
☆33Apr 16, 2024Updated 2 years ago
SCUT-DLVCLab / OCR-Reasoning
View on GitHub
[ICLR 2026] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning
☆76May 26, 2026Updated 2 months ago
lcy0604 / EraseNet
View on GitHub
☆157Jul 7, 2022Updated 4 years ago
amazon-science / textadain-robust-recognition
View on GitHub
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
☆21Jul 26, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HCIILAB / M6Doc
View on GitHub
☆164May 8, 2025Updated last year
ZZZHANG-jx / Marior
View on GitHub
[ACM MM 2022] Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild
☆26Aug 12, 2022Updated 3 years ago
HCIILAB / Scene-Text-End2end
View on GitHub
☆153Mar 27, 2020Updated 6 years ago
Mountchicken / Structured_Dreambooth_LoRA
View on GitHub
Dreambooth (LoRA) with well-organized code structure. Naive adaptation from 🤗Diffusers.
☆18May 18, 2023Updated 3 years ago
wangyuxin87 / Tampered_sroie
View on GitHub
The tampered text detection dataset
☆22Aug 23, 2023Updated 2 years ago
SCUT-DLVCLab / GPT-4V_OCR
View on GitHub
Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)
☆128Nov 13, 2023Updated 2 years ago
youdao-ai / SRNet-Datagen
View on GitHub
This is a data generator of SRNet which is the model of paper Editing Text in the wild.
☆116Jan 19, 2023Updated 3 years ago