yeungchenwa / HDR
[AAAI2025 Oral] Predicting the Original Appearance of Damaged Historical Documents
☆58Updated 2 months ago
Alternatives and similar repositories for HDR:
Users that are interested in HDR are comparing it to the libraries listed below
- VimTS: A Unified Video and Image Text Spotter☆76Updated 3 months ago
- The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation☆111Updated 3 months ago
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆60Updated 7 months ago
- ☆92Updated last year
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆139Updated 8 months ago
- JoyType: A Robust Design for Multilingual Visual Text Creation☆30Updated 3 months ago
- UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models☆217Updated last week
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆31Updated 5 months ago
- Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20…☆46Updated 7 months ago
- [2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control☆58Updated last month
- Official implementation of High Fidelity Scene Text Synthesis.☆46Updated last month
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆46Updated 8 months ago
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆58Updated 3 months ago
- Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step☆108Updated this week
- Handwritten Text Recognition and Character Detection☆130Updated 3 months ago
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆37Updated 6 months ago
- Dreambooth (LoRA) with well-organized code structure. Naive adaptation from 🤗Diffusers.☆14Updated last year
- [ACM MM 2022] Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild☆18Updated 2 years ago
- ☆77Updated last month
- [NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence☆138Updated 5 months ago
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆78Updated 5 months ago
- Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)☆122Updated last year
- A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text remova…☆228Updated 2 months ago
- [TAI 2023] Appearance Enhancement for Camera-captured Document Images in the Wild☆32Updated last year
- Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degra…☆25Updated 3 months ago
- ☆165Updated 11 months ago
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆58Updated 8 months ago
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆131Updated 3 weeks ago
- Text-To-Image Generation with Chinese Characters☆127Updated last year