CyberAgentAILab / flex-dmLinks
[CVPR 2023 highlight] Towards Flexible Multi-modal Document Models
☆57Updated last year
Alternatives and similar repositories for flex-dm
Users that are interested in flex-dm are comparing it to the libraries listed below
Sorting:
- Implementation of CanvasVAE: Learning to Generate Vector Graphic Documents, ICCV 2021☆66Updated 2 years ago
- OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]☆71Updated 3 months ago
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆96Updated last month
- ☆80Updated 2 years ago
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆22Updated last year
- Cheng-Fu Yang*, Wan-Cyuan Fan*, Fu-En Yang, Yu-Chiang Frank Wang, "LayoutTransformer: Scene Layout Generation with Conceptual and Spatial…☆59Updated 3 years ago
- Continuous diffusion for layout generation☆45Updated 4 months ago
- Official code for paper: Desigen: A Pipeline for Controllable Design Template Generation [CVPR'24]☆70Updated 11 months ago
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild☆30Updated last year
- ☆97Updated last year
- PyTorch implementation of "LayoutTransformer: Layout Generation and Completion with Self-attention" to appear in ICCV 2021☆160Updated 3 years ago
- ☆39Updated last year
- Source code of the TextLap model, a LLM for text-2-layout generation.☆15Updated 8 months ago
- LayoutFlow: Flow Matching for Layout Generation [Andrade Guerreiro et al., ECCV 2024]☆32Updated last month
- A curated list of papers and resources for text-to-image evaluation.☆29Updated last year
- ☆92Updated 10 months ago
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆108Updated last year
- [NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"☆233Updated 11 months ago
- Official repository for "PosterLayout: A New Benchmark and Approach for Content-aware Visual-Textual Presentation Layout" (CVPR 2023).☆139Updated 2 months ago
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆84Updated 4 months ago
- Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation☆48Updated 10 months ago
- ☆27Updated 4 years ago
- A Diffusion training toolbox based on diffusers and existing SOTA methods, including Dreambooth, Texual Inversion, LoRA, Custom Diffusion…☆80Updated 8 months ago
- ☆20Updated 10 months ago
- ☆13Updated 5 months ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆42Updated 2 years ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆81Updated 2 years ago
- FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions☆55Updated last year
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆80Updated last year
- Minimal Differentiable Image Reward Functions☆60Updated 2 months ago