yeungchenwa / HDR
[AAAI2025 Oral] Predicting the Original Appearance of Damaged Historical Documents
☆72Updated last month
Alternatives and similar repositories for HDR:
Users that are interested in HDR are comparing it to the libraries listed below
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆51Updated 10 months ago
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆71Updated 9 months ago
- [TAI 2023] Appearance Enhancement for Camera-captured Document Images in the Wild☆36Updated last year
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆35Updated 7 months ago
- ☆27Updated 2 months ago
- The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation☆119Updated 5 months ago
- VimTS: A Unified Video and Image Text Spotter☆77Updated 5 months ago
- Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degra…☆28Updated 4 months ago
- [2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control☆73Updated last month
- Evaluating GPT-4o's image generation and editing ability in OCR tasks.☆41Updated 2 weeks ago
- Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20…☆51Updated 9 months ago
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆44Updated 8 months ago
- A Token-level Text Image Foundation Model for Document Understanding☆89Updated 3 weeks ago
- A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text remova…☆243Updated 4 months ago
- The official project of paper "Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing"☆60Updated 3 months ago
- ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along w…☆268Updated 7 months ago
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆60Updated 10 months ago
- UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models☆224Updated 2 months ago
- ☆84Updated 3 months ago
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆34Updated 3 weeks ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Updated last year
- [NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence☆143Updated 7 months ago
- [AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Lear…☆371Updated last year
- ☆95Updated last year
- [EMNLP 2024] TongGu, a classical Chinese language model.☆35Updated 6 months ago
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆60Updated 3 weeks ago
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆34Updated last week
- [ECCV2024] PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer☆72Updated last week
- This repository is the implementation of "Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Contex…