davidhalladay / LayoutTransformer
Cheng-Fu Yang*, Wan-Cyuan Fan*, Fu-En Yang, Yu-Chiang Frank Wang, "LayoutTransformer: Scene Layout Generation with Conceptual and Spatial Diversity", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021.
☆58Updated 2 years ago
Alternatives and similar repositories for LayoutTransformer:
Users that are interested in LayoutTransformer are comparing it to the libraries listed below
- Towards Flexible Multi-modal Document Models [Inoue+, CVPR2023]☆56Updated last year
- Implementation of CanvasVAE: Learning to Generate Vector Graphic Documents, ICCV 2021☆63Updated last year
- A list of papers and other resources on language-guided image editing.☆38Updated 4 years ago
- Context-Aware Layout to Image Generation with Enhanced Object Appearance☆47Updated last year
- ☆50Updated 2 years ago
- (wip) Use LAION-AI's CLIP "conditoned prior" to generate CLIP image embeds from CLIP text embeds.☆27Updated 2 years ago
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Updated 2 years ago
- One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations. NeurIPS2022.☆34Updated last year
- Code for "Learning Canonical Representations for Scene Graph to Image Generation", Herzig & Bar et al., ECCV2020☆30Updated 2 years ago
- [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.☆107Updated last year
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆84Updated last year
- PyTorch implementation of "LayoutTransformer: Layout Generation and Completion with Self-attention" to appear in ICCV 2021☆154Updated 2 years ago
- Continuous diffusion for layout generation☆37Updated 9 months ago
- Official implementation for "Diffusion-Based Scene Graph to Image Generation with Masked Contrastive Pre-Training" https://arxiv.org/abs/…☆64Updated 3 weeks ago
- Layout Generation and Baseline implementations☆148Updated 2 years ago
- ☆97Updated 7 months ago
- [Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training☆21Updated 2 years ago
- [AAAI 2024] ConceptBed Evaluations for Personalized Text-to-Image Diffusion Models☆24Updated last year
- Official repository for the General Robust Image Task (GRIT) Benchmark☆50Updated last year
- ImaginaryNet: Learning Object Detectors without Real Images and Annotations☆26Updated last year
- Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"☆55Updated last year
- Research code for "Training Vision-Language Transformers from Captions Alone"☆33Updated 2 years ago
- Code release for LayoutDiffuse☆52Updated last year
- Simple script to compute CLIP-based scores given a DALL-e trained model.☆30Updated 3 years ago
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Updated 2 years ago
- Implementation of Sketch-BERT: Learning Sketch Bidirectional Encoder Representation from Transformers by Self-supervised Learning of Sket…☆29Updated 2 years ago
- An pytorch implementation of our NeurIPS paper of PasteGAN: A Semi-Parametric Method to Generate Image from Scene Graph☆53Updated 2 years ago
- Release of ImageNet-Captions☆45Updated last year
- [ECCV2022] Contrastive Vision-Language Pre-training with Limited Resources☆45Updated 2 years ago