CyberAgentAILab / flex-dmLinks
[CVPR 2023 highlight] Towards Flexible Multi-modal Document Models
☆59Updated 2 years ago
Alternatives and similar repositories for flex-dm
Users that are interested in flex-dm are comparing it to the libraries listed below
Sorting:
- Implementation of CanvasVAE: Learning to Generate Vector Graphic Documents, ICCV 2021☆69Updated 2 years ago
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆100Updated 4 months ago
- ☆81Updated 2 years ago
- Cheng-Fu Yang*, Wan-Cyuan Fan*, Fu-En Yang, Yu-Chiang Frank Wang, "LayoutTransformer: Scene Layout Generation with Conceptual and Spatial…☆63Updated 3 years ago
- Continuous diffusion for layout generation☆47Updated 7 months ago
- OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]☆78Updated 6 months ago
- [NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"☆235Updated last year
- Official code for paper: Desigen: A Pipeline for Controllable Design Template Generation [CVPR'24]☆71Updated last year
- [CVPR 2024 Oral] Official repository for RALF: Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation☆134Updated last year
- Source code of the TextLap model, a LLM for text-2-layout generation.☆15Updated 10 months ago
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆108Updated last year
- ☆97Updated last year
- Layout Generation and Baseline implementations☆158Updated 3 years ago
- ☆99Updated last year
- FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions☆55Updated last year
- ☆21Updated last year
- Official repository for "PosterLayout: A New Benchmark and Approach for Content-aware Visual-Textual Presentation Layout" (CVPR 2023).☆142Updated 5 months ago
- ☆27Updated 4 years ago
- PyTorch implementation of "LayoutTransformer: Layout Generation and Completion with Self-attention" to appear in ICCV 2021☆163Updated 3 years ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆81Updated 2 years ago
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆22Updated last year
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…☆120Updated 2 years ago
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild☆31Updated last year
- Official Repo of Graphist☆125Updated last year
- Official implementation of the MM'21 paper "Constrained Graphic Layout Generation via Latent Optimization" (LayoutGAN++, CLG-LO, and Layo…☆140Updated 2 years ago
- Implementation of a light-weighted Latent-Composer in PyTorch based on "Composer: Creative and Controllable Image Synthesis with Composab…☆39Updated 2 years ago
- Text-To-Image Generation with Chinese Characters☆131Updated 2 years ago
- LayoutFlow: Flow Matching for Layout Generation [Andrade Guerreiro et al., ECCV 2024]☆33Updated 4 months ago
- Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation☆49Updated last year
- ☆92Updated 2 years ago