CyberAgentAILab / webcolor
Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.
☆22Updated last year
Alternatives and similar repositories for webcolor:
Users that are interested in webcolor are comparing it to the libraries listed below
- [ECCV2022] Mind the Gap in Distilling StyleGANs☆29Updated last year
- Towards Flexible Multi-modal Document Models [Inoue+, CVPR2023]☆56Updated last year
- Source code of the TextLap model, a LLM for text-2-layout generation.☆13Updated 2 months ago
- ☆21Updated last year
- A curated list of papers and resources for text-to-image evaluation.☆26Updated last year
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆33Updated 6 months ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆33Updated 10 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆61Updated 8 months ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆41Updated last year
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆28Updated 8 months ago
- An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.☆34Updated last year
- [TMM 2022] ISF-GAN.☆17Updated 3 weeks ago
- Masked Vision-Language Transformer in Fashion☆33Updated last year
- The official repo of continuous speculative decoding☆21Updated last month
- This project provides a data set with bounding boxes, body poses, 3D face meshes & captions of people from our LAION-2.2B. Additionally i…☆13Updated 3 years ago
- An official PyTorch implementation for CLIPPR☆29Updated last year
- OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]☆53Updated 3 weeks ago
- ☆35Updated 6 months ago
- FuseCap: Large Language Model for Visual Data Fusion in Enriched Caption Generation☆51Updated 9 months ago
- A Versatile Face Encoder for Zero-Shot Diffusion Model Personalization☆22Updated this week
- Fast Sprite Decomposition from Animated Graphics [ECCV2024]☆29Updated 3 months ago
- ImaginaryNet: Learning Object Detectors without Real Images and Annotations☆26Updated last year
- ☆16Updated 5 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆41Updated 5 months ago
- (wip) Use LAION-AI's CLIP "conditoned prior" to generate CLIP image embeds from CLIP text embeds.☆27Updated 2 years ago
- ☆17Updated 2 years ago
- Official pytorch implementation of I2I translation with low resolution conditioning☆23Updated 3 years ago
- DreamDance: Personalized Text-to-video Generation by Combining Text-to-Image Synthesis and Motion Transfer☆14Updated 2 years ago
- ☆19Updated last year