LAION-AI / LAION-PEOPLE
This project provides a data set with bounding boxes, body poses, 3D face meshes & captions of people from our LAION-2.2B. Additionally it provides clusters based on the poses and face meshes and pose-related captions based on these cluster assignments.
☆13Updated 2 years ago
Related projects: ⓘ
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆21Updated 9 months ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆17Updated 3 years ago
- Pytorch implementation of StyleGAN2 in my style☆11Updated last year
- ☆17Updated last year
- [ECCV2022] Mind the Gap in Distilling StyleGANs☆28Updated last year
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- Official pytorch implementation of the IrwGAN for unaligned image-to-image translation☆34Updated 2 years ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆32Updated last year
- ISF-GAN, TMM 2022.☆17Updated 2 years ago
- [FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.☆26Updated 2 years ago
- This is an official implementation of GRIT-VLP☆20Updated 2 years ago
- ☆21Updated last year
- [ICCV 2021] Click to Move: Controlling Video Generation with Sparse Motion☆11Updated last year
- A collection of papers I am interested in.☆28Updated last year
- A non-JIT version implementation / replication of CLIP of OpenAI in pytorch☆34Updated 3 years ago
- Official repository for Polarity Sampling, CVPR 2022 ORAL☆12Updated 2 years ago
- ☆19Updated 3 years ago
- ☆37Updated 2 years ago
- Official pytorch implementation of I2I translation with low resolution conditioning☆23Updated 3 years ago
- ☆46Updated last year
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆72Updated last year
- Bag of MLP☆20Updated 3 years ago
- Evaluation benchmark for the task of Semantic Image Translation. Contains code to run FlexIT (CVPR 2022)☆32Updated 2 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18Updated last year
- DreamDance: Personalized Text-to-video Generation by Combining Text-to-Image Synthesis and Motion Transfer☆14Updated last year
- CVPR 2021, Smoothing the Disentangled Latent Style Space for Unsupervised I2I Translation☆41Updated last year
- This is a collection of resources on AI-AR-ART generation.☆29Updated last year
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆30Updated 6 months ago
- ☆29Updated last year