salesforce / LayoutDETR
The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'
☆68Updated 11 months ago
Related projects: ⓘ
- Towards Flexible Multi-modal Document Models [Inoue+, CVPR2023]☆55Updated last year
- OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]☆41Updated last week
- Official repository for "PosterLayout: A New Benchmark and Approach for Content-aware Visual-Textual Presentation Layout" (CVPR 2023).☆115Updated 2 months ago
- ☆119Updated 7 months ago
- ☆32Updated 8 months ago
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆73Updated last month
- ☆78Updated 8 months ago
- Continuous diffusion for layout generation☆26Updated 5 months ago
- Cheng-Fu Yang*, Wan-Cyuan Fan*, Fu-En Yang, Yu-Chiang Frank Wang, "LayoutTransformer: Scene Layout Generation with Conceptual and Spatial…☆58Updated 2 years ago
- Implementation of CanvasVAE: Learning to Generate Vector Graphic Documents, ICCV 2021☆61Updated last year
- ☆89Updated 4 months ago
- Official code for paper: Desigen: A Pipeline for Controllable Design Template Generation [CVPR'24]☆54Updated 2 months ago
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆129Updated 4 months ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆58Updated last week
- ☆37Updated 3 weeks ago
- ☆34Updated last month
- Official Repo of Graphist☆92Updated 4 months ago
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆56Updated 2 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆75Updated 2 months ago
- ☆58Updated 10 months ago
- VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆93Updated last month
- [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation☆62Updated 7 months ago
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆105Updated 7 months ago
- ☆128Updated 8 months ago
- [NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"☆201Updated 2 months ago
- Official code repo for "Editing Implicit Assumptions in Text-to-Image Diffusion Models"☆81Updated last year
- ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023☆114Updated 10 months ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆72Updated last year
- Implementation of InstructEdit☆66Updated 10 months ago