LexCybermac / smlr

A Simple Image Clustering Script using CLIP and Hierarchial Clustering

☆34

Alternatives and similar repositories for smlr:

Users that are interested in smlr are comparing it to the libraries listed below

ariG23498 / TokenLearner
TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"
☆33Updated 3 years ago
awilliamson10 / clipora
Clipora is a powerful toolkit for fine-tuning OpenCLIP models using Low Rank Adapters (LoRA).
☆19Updated 6 months ago
Zasder3 / train-CLIP-FT
☆46Updated 3 years ago
damian0815 / finetune-clip-huggingface
Finetuning CLIP on a small image/text dataset using huggingface libs
☆44Updated 2 years ago
sergiuoprea / clip_with_few_shots
Few shot recognition using CLIP's OpenAI architecture.
☆36Updated 3 years ago
jeykigung / HiCLIP
☆29Updated last year
GewelsJI / MVLT
Masked Vision-Language Transformer in Fashion
☆33Updated last year
WalBouss / MaskInversion
☆23Updated 4 months ago
elsevierlabs-os / clip-image-search
Fine-tuning OpenAI CLIP Model for Image Search on medical images
☆76Updated 2 years ago
InfiMM / mllm-hd
Official code for infimm-hd
☆15Updated 5 months ago
data2ml / all-clip
Load any clip model with a standardized interface
☆21Updated 9 months ago
Expedit-LargeScale-Vision-Transformer / Expedit-SAM
[NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fi…
☆83Updated last year
taesiri / ZoomIsAllYouNeed
Official code and data for NeurIPS 2023 paper "ImageNet-Hard: The Hardest Images Remaining from a Study of the Power of Zoom and Spatial …
☆37Updated last year
KaliYuga-ai / blip-lora-dreambooth-finetuning
☆30Updated 2 years ago
hirl-team / HIRL
HIRL: A General Framework for Hierarchical Image Representation Learning (http://arxiv.org/abs/2205.13159)
☆40Updated 2 years ago
jialuli-luka / SELMA
Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
☆33Updated 11 months ago
sehyunkwon / ICTC
This is a public repository for Image Clustering Conditioned on Text Criteria (IC|TC)
☆83Updated 11 months ago
johnowhitaker / imstack
Optimizable stack of images at different resolutions, a useful representation of images for deep learning tasks. Docs: https://johnowhita…
☆11Updated 2 years ago
salesforce / MUST
PyTorch code for MUST
☆106Updated last year
shonenkov / CLIP-ODS
CLIP Object Detection, search object on image using natural language #Zeroshot #Unsupervised #CLIP #ODS
☆139Updated 3 years ago
facebookresearch / genecis
Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"
☆56Updated last year
usc-sail / mica-MovieCLIP
This repository contains the codebase for MovieCLIP: Visual Scene Recognition in Movies
☆35Updated last year
jeongukjae / CLIP-self-attention-visualization
Plotting heatmaps with the self-attention of the [CLS] tokens in the last layer.
☆42Updated 2 years ago
eric-ai-lab / ComCLIP
Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"
☆35Updated 6 months ago
TencentARC / ViSFT
☆34Updated last year
RotsteinNoam / FuseCap
FuseCap: Large Language Model for Visual Data Fusion in Enriched Caption Generation
☆53Updated 10 months ago
BrandonHanx / FAME-ViL
[CVPR 2023 (Highlight)] FAME-ViL: Multi-Tasking V+L Model for Heterogeneous Fashion Tasks
☆52Updated last year
FrancescoSaverioZuppichini / DropPath
Implementing DropPath/StochasticDepth in PyTorch
☆16Updated 3 years ago