xxyQwQ / metascriptLinks
Project of AI3604 Computer Vision, 2023 Fall, SJTU
☆24Updated 2 months ago
Alternatives and similar repositories for metascript
Users that are interested in metascript are comparing it to the libraries listed below
Sorting:
- Text Image Inpainting via Global Structure-Guided Diffusion Models (Accepted by AAAI-24)☆74Updated 7 months ago
- The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""☆86Updated last month
- [PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Ca…☆67Updated 4 months ago
- The official code implementation of "LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis".☆36Updated 4 months ago
- [NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching☆27Updated 5 months ago
- ☆99Updated last year
- Dreambooth (LoRA) with well-organized code structure. Naive adaptation from 🤗Diffusers.☆15Updated 2 years ago
- PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution (ACMMM 2024)☆51Updated 11 months ago
- This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).☆143Updated 7 months ago
- [ACL 2025 main] The official GitHub page of "Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restorati…☆48Updated last month
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆49Updated last year
- [AAAI2025] Official Implementation of "FOCUS: Towards Universal Foreground Segmentation"☆55Updated 4 months ago
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆77Updated 7 months ago
- Official repository for CF-Font: Content Fusion for Few-shot Font Generation.☆137Updated 2 years ago
- Official code implementation of " TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image " in Pattern Recognition☆24Updated last year
- [AAAI2024] Official PyTorch implementation of VQ-Font: Few-Shot Font Generation with Structure-Aware Enhancement and Quantization.☆54Updated last year
- Code Implementation of "Uni-paint: A Unified Framework for Multimodal Image Inpainting with Pretrained Diffusion Model"☆127Updated 8 months ago
- A simple and flexible PyTorch implementation of StableDiffusion based on diffusers.☆24Updated last year
- ☆12Updated last year
- ☆99Updated last year
- [2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control☆96Updated 8 months ago
- Official Implementations "StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing" (CVMJ2024)☆78Updated last year
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆44Updated 7 months ago
- Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20…☆61Updated last year
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆64Updated last year
- [ECCV 2024] SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation,☆43Updated 8 months ago
- [arXiv: 2505.12307] LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?☆34Updated 6 months ago
- Official code for our CVPR 2025 paper: "Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption"☆59Updated 2 months ago
- ☆43Updated 6 months ago
- ☆12Updated last year