CyberAgentAILab / flex-dm
Towards Flexible Multi-modal Document Models [Inoue+, CVPR2023]
☆56Updated last year
Alternatives and similar repositories for flex-dm:
Users that are interested in flex-dm are comparing it to the libraries listed below
- Implementation of CanvasVAE: Learning to Generate Vector Graphic Documents, ICCV 2021☆63Updated last year
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆87Updated 2 weeks ago
- OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]☆56Updated last month
- ☆79Updated 2 years ago
- Continuous diffusion for layout generation☆37Updated 9 months ago
- Cheng-Fu Yang*, Wan-Cyuan Fan*, Fu-En Yang, Yu-Chiang Frank Wang, "LayoutTransformer: Scene Layout Generation with Conceptual and Spatial…☆58Updated 2 years ago
- ☆90Updated 6 months ago
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆22Updated last year
- ☆37Updated last year
- ☆91Updated last year
- [NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"☆223Updated 7 months ago
- Source code of the TextLap model, a LLM for text-2-layout generation.☆13Updated 3 months ago
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild☆27Updated 10 months ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆76Updated 2 years ago
- Official repository for "PosterLayout: A New Benchmark and Approach for Content-aware Visual-Textual Presentation Layout" (CVPR 2023).☆127Updated 7 months ago
- ☆42Updated 6 months ago
- ☆87Updated last month
- PyTorch implementation of "LayoutTransformer: Layout Generation and Completion with Self-attention" to appear in ICCV 2021☆155Updated 3 years ago
- Diffusion Layout Transformer implementation.☆56Updated last year
- This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).☆127Updated 3 months ago
- Official code for paper: Desigen: A Pipeline for Controllable Design Template Generation [CVPR'24]☆65Updated 6 months ago
- ☆152Updated last month
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…☆119Updated last year
- Official implementation of High Fidelity Scene Text Synthesis.☆46Updated last month
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆105Updated last year
- Text-To-Image Generation with Chinese Characters☆127Updated last year
- Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation☆44Updated 6 months ago
- ☆26Updated 3 years ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆63Updated 5 months ago
- FuseCap: Large Language Model for Visual Data Fusion in Enriched Caption Generation☆53Updated 9 months ago