CyberAgentAILab / flex-dm
[CVPR 2023 highlight] Towards Flexible Multi-modal Document Models
☆56Updated last year
Alternatives and similar repositories for flex-dm
Users that are interested in flex-dm are comparing it to the libraries listed below
Sorting:
- Implementation of CanvasVAE: Learning to Generate Vector Graphic Documents, ICCV 2021☆65Updated 2 years ago
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆22Updated last year
- OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]☆65Updated 2 months ago
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆94Updated 3 months ago
- ☆80Updated 2 years ago
- Cheng-Fu Yang*, Wan-Cyuan Fan*, Fu-En Yang, Yu-Chiang Frank Wang, "LayoutTransformer: Scene Layout Generation with Conceptual and Spatial…☆59Updated 3 years ago
- Continuous diffusion for layout generation☆42Updated 2 months ago
- Source code of the TextLap model, a LLM for text-2-layout generation.☆15Updated 6 months ago
- ☆95Updated last year
- ☆20Updated 9 months ago
- Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation☆48Updated 9 months ago
- Official code for paper: Desigen: A Pipeline for Controllable Design Template Generation [CVPR'24]☆69Updated 9 months ago
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild☆30Updated last year
- ☆38Updated last year
- ☆93Updated 9 months ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆66Updated 8 months ago
- LayoutFlow: Flow Matching for Layout Generation [Andrade Guerreiro et al., ECCV 2024]☆32Updated 5 months ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆81Updated 2 years ago
- [NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"☆231Updated 10 months ago
- PyTorch implementation of "LayoutTransformer: Layout Generation and Completion with Self-attention" to appear in ICCV 2021☆159Updated 3 years ago
- ☆65Updated last year
- ☆26Updated 3 years ago
- ☆50Updated 9 months ago
- ☆13Updated 4 months ago
- Implementation of a light-weighted Latent-Composer in PyTorch based on "Composer: Creative and Controllable Image Synthesis with Composab…☆39Updated 2 years ago
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆107Updated last year
- [CVPR 2024 Oral] Official repository for RALF: Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation☆120Updated 10 months ago
- Official repository for "PosterLayout: A New Benchmark and Approach for Content-aware Visual-Textual Presentation Layout" (CVPR 2023).☆133Updated last month
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆62Updated last month
- This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).☆131Updated last month