yeungchenwa / HDR
[AAAI2025] Predicting the Original Appearance of Damaged Historical Documents
☆58Updated last month
Alternatives and similar repositories for HDR:
Users that are interested in HDR are comparing it to the libraries listed below
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆57Updated 6 months ago
- The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation☆110Updated 2 months ago
- VimTS: A Unified Video and Image Text Spotter☆74Updated 2 months ago
- Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step☆70Updated last week
- ☆90Updated last year
- JoyType: A Robust Design for Multilingual Visual Text Creation☆27Updated last month
- UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models☆216Updated 6 months ago
- [2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control☆47Updated this week
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆56Updated 2 months ago
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆135Updated 7 months ago
- Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20…☆44Updated 6 months ago
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆37Updated 5 months ago
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆43Updated 7 months ago
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆30Updated 4 months ago
- Official implementation of High Fidelity Scene Text Synthesis.☆45Updated 2 weeks ago
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆76Updated 3 months ago
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆129Updated 7 months ago
- Handwritten Text Recognition and Character Detection☆120Updated 2 months ago
- ☆74Updated 3 weeks ago
- [TAI 2023] Appearance Enhancement for Camera-captured Document Images in the Wild☆30Updated last year
- Text-To-Image Generation with Chinese Characters☆125Updated last year
- A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text remova…☆221Updated 3 weeks ago
- [NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence☆134Updated 4 months ago
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆57Updated 7 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆178Updated this week
- ☆162Updated last month
- [ECCV 2024] Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models☆67Updated 2 months ago
- Dreambooth (LoRA) with well-organized code structure. Naive adaptation from 🤗Diffusers.☆13Updated last year
- Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degra…☆22Updated last month
- Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)☆121Updated last year