Simon-Lepage / CondViT-LRVSF
Implementation of Conditional ViT on LAION — Referred Visual Search — Fashion
☆38Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for CondViT-LRVSF
- Official implementation of "Is This Loss Informative? Faster Text-to-Image Customization by Tracking Objective Dynamics" (NeurIPS 2023)☆37Updated last year
- Diffusion base mining☆48Updated last month
- Code for "Don’t drop your samples! Coherence-aware training benefits Conditional diffusion" CVPR 2024 Highlight☆44Updated 5 months ago
- [ECCV 2024] Code for "EraseDraw: Learning to Insert Objects by Erasing Them from Images"☆17Updated 3 months ago
- High order Moment Models☆22Updated last week
- ☆71Updated last year
- ☆65Updated last year
- Navigate dreamscapes with a click – your chosen point guides the drone’s flight in a thrilling visual journey.☆42Updated 10 months ago
- Fast Sprite Decomposition from Animated Graphics [ECCV2024]☆27Updated last month
- Official PyTorch implementation of the paper "Neural Congealing: Aligning Images to a Joint Semantic Atlas" (CVPR 2023)☆46Updated last year
- A Versatile Face Encoder for Zero-Shot Diffusion Model Personalization☆19Updated this week
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆70Updated 3 months ago
- ☆44Updated 3 months ago
- Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion☆35Updated 3 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆43Updated 2 weeks ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆60Updated 6 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆30Updated 4 months ago
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆48Updated this week
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆32Updated 8 months ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆41Updated last year
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆26Updated 2 weeks ago
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆81Updated 3 months ago
- Implementation of MDP: A Generalized Framework for Text-Guided Image Editing by Manipulating the Diffusion Path☆66Updated last year
- ☆20Updated last month
- Official Implementation of Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics☆80Updated last month
- This repository provides utilities to a minimal dataset for InstructPix2Pix like training for Diffusion models.☆43Updated last year
- A curated list of papers and resources for text-to-image evaluation.☆26Updated last year
- ☆40Updated last year
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆105Updated 9 months ago