xxyQwQ / metascriptLinks
Project of AI3604 Computer Vision, 2023 Fall, SJTU
☆20Updated 10 months ago
Alternatives and similar repositories for metascript
Users that are interested in metascript are comparing it to the libraries listed below
Sorting:
- Text Image Inpainting via Global Structure-Guided Diffusion Models (Accepted by AAAI-24)☆73Updated 3 months ago
- [PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Ca…☆60Updated 2 weeks ago
- [ACL 2025 main] The official GitHub page of "Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restorati…☆38Updated last week
- [arXiv: 2505.12307] LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?☆24Updated 2 months ago
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆67Updated 4 months ago
- ☆99Updated last year
- The official code implementation of "LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis".☆34Updated 3 weeks ago
- ☆40Updated 6 months ago
- [NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching☆26Updated 2 months ago
- This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).☆138Updated 3 months ago
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆45Updated 11 months ago
- The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation☆128Updated 8 months ago
- Official repository for CF-Font: Content Fusion for Few-shot Font Generation.☆132Updated 2 years ago
- Q-Insight is open-sourced at https://github.com/bytedance/Q-Insight. This repository will not receive further updates.☆143Updated 2 months ago
- Code Implementation of "Uni-paint: A Unified Framework for Multimodal Image Inpainting with Pretrained Diffusion Model"☆123Updated 4 months ago
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆41Updated 10 months ago
- The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""☆72Updated last month
- A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text remova…☆259Updated 7 months ago
- [arXiv 25] Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR☆199Updated last week
- HINT: High-quality INpainting Transformer with Enhanced Attention and Mask-aware Encoding☆45Updated 6 months ago
- [AAAI2024] Official PyTorch implementation of VQ-Font: Few-Shot Font Generation with Structure-Aware Enhancement and Quantization.☆49Updated last year
- [2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control☆87Updated 4 months ago
- Official code for K-LoRA (CVPR 2025)☆118Updated last month
- [AAAI 2025] 🎬RCDMs🎬: Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story…☆34Updated last month
- PyTorch implementation for the paper Don't Look into the Dark: Latent Codes for Pluralistic Image Inpainting (CVPR2024).☆34Updated last year
- [NeurIPS 2023] Official Code for CycleNet: Rethinking Cycle Consistent in Text‑Guided Diffusion for Image Manipulation☆91Updated last year
- [AAAI 2023] CoordFill: Efficient High-Resolution Image Inpainting via Parameterized Coordinate Querying☆92Updated last year
- Official code implementation of " TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image " in Pattern Recognition☆24Updated last year
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆121Updated 8 months ago
- ☆96Updated 11 months ago