tcl9876 / Diffusion-Handwriting-Generation
☆96Updated 3 years ago
Related projects: ⓘ
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆61Updated 2 months ago
- ☆71Updated 9 months ago
- ☆31Updated last year
- Code and data for ECCV 2020 paper Generating Handwriting via Decoupled Style Descriptors☆53Updated 2 years ago
- Diffusion-based markup-to-image generation☆78Updated last year
- code for CLIPDraw☆125Updated 2 years ago
- [NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"☆201Updated 2 months ago
- Source code for ECCV20 "GANwriting: Content-Conditioned Generation of Styled Handwritten Word Images"☆67Updated 3 years ago
- ☆85Updated last month
- JAX implementation ViT-VQGAN☆77Updated last year
- Code for BMVC2020 paper "Text and Style Conditioned GAN for Generation of Offline Handwriting Lines"☆66Updated last year
- ☆104Updated last year
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆69Updated 3 years ago
- Handwriting-Transformers (ICCV21)☆172Updated 6 months ago
- Official repository accompaying the ICDAR 2023 paper☆10Updated 11 months ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆23Updated last year
- Towards Flexible Multi-modal Document Models [Inoue+, CVPR2023]☆55Updated last year
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆64Updated 2 years ago
- ☆18Updated 2 years ago
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆79Updated last year
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆43Updated 3 months ago
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆36Updated 11 months ago
- This repo is the official implementation of DeepCalliFont: Few-shot Chinese Calligraphy Font Synthesis by Integrating Dual-modality Gener…☆10Updated 4 months ago
- ☆152Updated 2 years ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆72Updated last year
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆68Updated 11 months ago
- ☆64Updated 11 months ago
- Simple large-scale training of stable diffusion with multi-node support.☆122Updated last year
- Official PyTorch implementation of the CVPR 2022 paper: "Look Closer to Supervise Better: One-Shot Font Generation via Component-Based Di…☆88Updated 2 years ago
- Optimized library for large-scale extraction of frames and audio from video.☆202Updated last year