99Franklin / DiffTextLinks
☆13Updated 6 months ago
Alternatives and similar repositories for DiffText
Users that are interested in DiffText are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Updated last year
- [arXiv: 2505.12307] LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?☆23Updated last month
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Updated 2 years ago
- ☆25Updated last year
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆81Updated 2 years ago
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild☆30Updated last year
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆11Updated 2 years ago
- [MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"☆15Updated last year
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆66Updated 10 months ago
- ☆38Updated last year
- Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration (CVPR2023)"☆10Updated last year
- ☆14Updated 2 years ago
- The official codes and datasets for Artistic Text Segmentation (ECCV 2024).☆25Updated 9 months ago
- [AAAI 2024] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression☆66Updated 4 months ago
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆67Updated 3 months ago
- The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer☆53Updated last year
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆17Updated 3 months ago
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆40Updated 10 months ago
- [CVPR 2023 highlight] Towards Flexible Multi-modal Document Models☆57Updated last year
- Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20…☆54Updated last year
- The official repo for the technical report "Scalable Mask Annotation for Video Text Spotting"☆17Updated 2 years ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Updated 2 years ago
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆56Updated last year
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆22Updated last year
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆66Updated last year
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆26Updated 2 years ago
- ☆81Updated 4 months ago
- [PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Ca…☆59Updated this week
- [NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching☆26Updated last month
- ☆99Updated last year