Simon-Lepage / CondViT-LRVSFLinks
Implementation of Conditional ViT on LAION — Referred Visual Search — Fashion
☆42Updated last year
Alternatives and similar repositories for CondViT-LRVSF
Users that are interested in CondViT-LRVSF are comparing it to the libraries listed below
Sorting:
- ☆73Updated 2 years ago
- Official implementation of "Is This Loss Informative? Faster Text-to-Image Customization by Tracking Objective Dynamics" (NeurIPS 2023)☆39Updated 2 years ago
- This is the official repository for the paper "OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data". …☆71Updated last year
- ☆64Updated 2 years ago
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆84Updated last year
- Official Implementation of weights2weights☆154Updated 10 months ago
- Controlling diffusion-based image generation with just a few strokes☆64Updated 2 years ago
- Easily compute clip embeddings from video frames☆147Updated 2 years ago
- ☆46Updated 5 months ago
- Fine-tune of Florence-2 for shot categorization.☆26Updated 10 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆65Updated last year
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆50Updated 11 months ago
- Iterable datapipelines for pytorch training.☆88Updated last year
- A Versatile Face Encoder for Zero-Shot Diffusion Model Personalization☆24Updated 6 months ago
- ☆24Updated last year
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆57Updated 11 months ago
- Diffusion Models as Data Mining Tools☆56Updated 8 months ago
- ☆86Updated 2 years ago
- Official Repo of ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for ControlNet☆30Updated last year
- [ECCV2024] Fast Sprite Decomposition from Animated Graphics☆31Updated last year
- ☆32Updated last year
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆42Updated 2 years ago
- Implementation of the text to video model LUMIERE from the paper: "A Space-Time Diffusion Model for Video Generation" by Google Research☆52Updated 11 months ago
- [NeurIPS 2022: Score-Based Modeling Workshop] Multiresolution Textual Inversion☆99Updated 2 years ago
- Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)☆81Updated last year
- Learning Motion from Low-Rank Adaptation☆46Updated last year
- JAX implementation ViT-VQGAN☆82Updated 3 years ago
- ☆24Updated 4 months ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆35Updated last year
- ☆82Updated 2 years ago