CyberAgentAILab / flex-dmLinks
[CVPR 2023 highlight] Towards Flexible Multi-modal Document Models
☆57Updated last year
Alternatives and similar repositories for flex-dm
Users that are interested in flex-dm are comparing it to the libraries listed below
Sorting:
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆94Updated 3 weeks ago
- Implementation of CanvasVAE: Learning to Generate Vector Graphic Documents, ICCV 2021☆65Updated 2 years ago
- Cheng-Fu Yang*, Wan-Cyuan Fan*, Fu-En Yang, Yu-Chiang Frank Wang, "LayoutTransformer: Scene Layout Generation with Conceptual and Spatial…☆59Updated 3 years ago
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆22Updated last year
- Continuous diffusion for layout generation☆43Updated 3 months ago
- OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]☆70Updated 2 months ago
- ☆80Updated 2 years ago
- Official code for paper: Desigen: A Pipeline for Controllable Design Template Generation [CVPR'24]☆70Updated 10 months ago
- ☆38Updated last year
- Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation☆48Updated 10 months ago
- Diffusion Layout Transformer implementation.☆59Updated last year
- Official repository for "PosterLayout: A New Benchmark and Approach for Content-aware Visual-Textual Presentation Layout" (CVPR 2023).☆137Updated 2 months ago
- ☆27Updated 4 years ago
- ☆95Updated last year
- Source code of the TextLap model, a LLM for text-2-layout generation.☆15Updated 7 months ago
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild☆30Updated last year
- Implementation of a light-weighted Latent-Composer in PyTorch based on "Composer: Creative and Controllable Image Synthesis with Composab…☆39Updated 2 years ago
- PyTorch implementation of "LayoutTransformer: Layout Generation and Completion with Self-attention" to appear in ICCV 2021☆160Updated 3 years ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆81Updated 2 years ago
- ☆13Updated 4 months ago
- LayoutFlow: Flow Matching for Layout Generation [Andrade Guerreiro et al., ECCV 2024]☆32Updated 3 weeks ago
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆63Updated 2 months ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆42Updated 2 years ago
- ☆20Updated 10 months ago
- ☆65Updated last year
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆108Updated last year
- ☆93Updated 10 months ago
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆85Updated 4 months ago
- This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).☆132Updated last month
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…☆120Updated 2 years ago