Simon-Lepage / CondViT-LRVSFLinks
Implementation of Conditional ViT on LAION — Referred Visual Search — Fashion
☆42Updated 10 months ago
Alternatives and similar repositories for CondViT-LRVSF
Users that are interested in CondViT-LRVSF are comparing it to the libraries listed below
Sorting:
- ☆64Updated 2 years ago
- Official implementation of "Is This Loss Informative? Faster Text-to-Image Customization by Tracking Objective Dynamics" (NeurIPS 2023)☆39Updated last year
- ☆73Updated 2 years ago
- Official Repo of ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for ControlNet☆30Updated 9 months ago
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆57Updated 5 months ago
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆82Updated 11 months ago
- Diffusion Models as Data Mining Tools☆54Updated 2 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆64Updated last year
- Controlling diffusion-based image generation with just a few strokes☆63Updated last year
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆63Updated 4 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated 5 months ago
- Official Implementation of weights2weights☆144Updated 4 months ago
- ☆21Updated 2 years ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆42Updated 2 years ago
- Implementation of MDP: A Generalized Framework for Text-Guided Image Editing by Manipulating the Diffusion Path☆69Updated 2 years ago
- A Versatile Face Encoder for Zero-Shot Diffusion Model Personalization☆24Updated this week
- This repository provides utilities to a minimal dataset for InstructPix2Pix like training for Diffusion models.☆47Updated 2 years ago
- ☆24Updated 2 years ago
- This is the official repository for the paper "OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data". …☆68Updated last year
- ☆22Updated 2 months ago
- JAX implementation ViT-VQGAN☆83Updated 2 years ago
- Fast Sprite Decomposition from Animated Graphics [ECCV2024]☆32Updated 9 months ago
- ☆27Updated last year
- ☆4Updated 9 months ago
- Easily compute clip embeddings from video frames☆145Updated last year
- ☆31Updated last year
- ☆46Updated 11 months ago
- Fine-tune of Florence-2 for shot categorization.☆26Updated 4 months ago
- Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)☆79Updated last year
- Distilling Diversity and Control in Diffusion Models☆44Updated 2 months ago