LexCybermac / smlr
A Simple Image Clustering Script using CLIP and Hierarchial Clustering
☆27Updated last year
Related projects: ⓘ
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆72Updated last year
- Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data - Official PyTorch Implementati…☆35Updated last year
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆73Updated last month
- Evaluation benchmark for the task of Semantic Image Translation. Contains code to run FlexIT (CVPR 2022)☆32Updated 2 years ago
- ☆21Updated last year
- TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"☆33Updated 2 years ago
- Masked Vision-Language Transformer in Fashion☆32Updated 11 months ago
- ☆58Updated 10 months ago
- ☆50Updated 2 years ago
- This is the official repository for the paper "OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data". …☆52Updated 4 months ago
- Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"☆54Updated last year
- ☆32Updated 8 months ago
- Release of ImageNet-Captions☆45Updated last year
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆30Updated 6 months ago
- Official implementation of the paper The Hidden Language of Diffusion Models☆66Updated 7 months ago
- A Versatile Face Encoder for Zero-Shot Diffusion Model Personalization☆18Updated this week
- A curated list of papers and resources for text-to-image evaluation.☆26Updated last year
- ☆43Updated 3 years ago
- This repository contains the codebase for MovieCLIP: Visual Scene Recognition in Movies☆30Updated 11 months ago
- [ACM MM 2022] Towards Counterfactual Image Manipulation via CLIP☆37Updated 2 years ago
- ☆17Updated last year
- [ICLR'23] Code to reproduce the results in the paper "PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs"☆58Updated last year
- ☆29Updated last year
- [FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.☆26Updated 2 years ago
- Official code repo for "Editing Implicit Assumptions in Text-to-Image Diffusion Models"☆81Updated last year
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆61Updated 4 months ago
- A Challenging Benchmark of Anime Style Recognition☆23Updated last week
- CLIP-Art: Contrastive Pre-training for Fine-Grained Art Classification - 4th Workshop on Computer Vision for Fashion, Art, and Design☆27Updated 2 years ago
- The implementation for Accelerating Guided Diffusion Sampling with Splitting Numerical Methods (2023)☆48Updated last year