xxyQwQ / metascriptLinks

Project of AI3604 Computer Vision, 2023 Fall, SJTU

☆20

Alternatives and similar repositories for metascript

Users that are interested in metascript are comparing it to the libraries listed below

Sorting:

blackprotoss / GSDM
Text Image Inpainting via Global Structure-Guided Diffusion Models (Accepted by AAAI-24)
☆73Updated 3 months ago
SCUT-DLVCLab / MegaHan97K
[PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Ca…
☆60Updated 2 weeks ago
SCUT-DLVCLab / AutoHDR
[ACL 2025 main] The official GitHub page of "Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restorati…
☆38Updated last week
MiliLab / LogicOCR
[arXiv: 2505.12307] LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?
☆24Updated 2 months ago
CodeGoat24 / DreamText
[CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.
☆67Updated 4 months ago
ecnuljzhang / brush-your-text
☆99Updated last year
AlonzoLeeeooo / LCDG
The official code implementation of "LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis".
☆34Updated 3 weeks ago
hithqd / DynamicControl
☆40Updated 6 months ago
Hxyz-123 / GoMatching
[NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching
☆26Updated 2 months ago
chenhaoxing / DiffUTE
This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).
☆138Updated 3 months ago
ispamm / NAF-DPM
NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement
☆45Updated 11 months ago
bytedance / TextHarmony
The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation
☆128Updated 8 months ago
wangchi95 / CF-Font
Official repository for CF-Font: Content Fusion for Few-shot Font Generation.
☆132Updated 2 years ago
lwq20020127 / Q-Insight
Q-Insight is open-sourced at https://github.com/bytedance/Q-Insight. This repository will not receive further updates.
☆143Updated 2 months ago
ysy31415 / unipaint
Code Implementation of "Uni-paint: A Unified Framework for Multimodal Image Inpainting with Pretrained Diffusion Model"
☆123Updated 4 months ago
bzluan / TextCoT
The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.
☆41Updated 10 months ago
shuyansy / Visual-Text-Processing-survey
The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""
☆72Updated last month
yeungchenwa / Recommendations-Diffusion-Text-Image
A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text remova…
☆259Updated 7 months ago
NiceRingNode / Awesome-Generative-Models-for-OCR
[arXiv 25] Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR
☆199Updated last week
ChrisChen1023 / HINT
HINT: High-quality INpainting Transformer with Enhanced Attention and Mask-aware Encoding
☆45Updated 6 months ago
Yaomingshuai / VQ-Font
[AAAI2024] Official PyTorch implementation of VQ-Font: Few-Shot Font Generation with Structure-Aware Enhancement and Quantization.
☆49Updated last year
weichaozeng / TextCtrl
[2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
☆87Updated 4 months ago
HVision-NKU / K-LoRA
Official code for K-LoRA (CVPR 2025)
☆118Updated last month
muzishen / RCDMs
[AAAI 2025] 🎬RCDMs🎬: Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story…
☆34Updated last month
nintendops / latent-code-inpainting
PyTorch implementation for the paper Don't Look into the Dark: Latent Codes for Pluralistic Image Inpainting (CVPR2024).
☆34Updated last year
sled-group / CycleNet
[NeurIPS 2023] Official Code for CycleNet: Rethinking Cycle Consistent in Text‑Guided Diffusion for Image Manipulation
☆91Updated last year
NiFangBaAGe / CoordFill
[AAAI 2023] CoordFill: Efficient High-Resolution Image Inpainting via Parameterized Coordinate Querying
☆92Updated last year
Lenubolim / TextDiff
Official code implementation of " TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image " in Pattern Recognition
☆24Updated last year
MiracleDance / CAR
CAR: Controllable AutoRegressive Modeling for Visual Generation
☆121Updated 8 months ago
UCSB-NLP-Chang / DiffSTE
☆96Updated 11 months ago