LexTypeC / smlrLinks
A Simple Image Clustering Script using CLIP and Hierarchial Clustering
☆38Updated 2 years ago
Alternatives and similar repositories for smlr
Users that are interested in smlr are comparing it to the libraries listed below
Sorting:
- Clipora is a powerful toolkit for fine-tuning OpenCLIP models using Low Rank Adapters (LoRA).☆24Updated last year
- ☆36Updated last year
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆102Updated last year
- ☆48Updated 4 years ago
- HIRL: A General Framework for Hierarchical Image Representation Learning (http://arxiv.org/abs/2205.13159)☆40Updated 3 years ago
- An official PyTorch implementation for CLIPPR☆30Updated 2 years ago
- TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"☆36Updated 4 years ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆82Updated 2 years ago
- Official code and data for NeurIPS 2023 paper "ImageNet-Hard: The Hardest Images Remaining from a Study of the Power of Zoom and Spatial …☆40Updated 2 years ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆35Updated last year
- Official implementation of "Active Image Indexing"☆60Updated 2 years ago
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆56Updated last year
- Masking Strategies for Background Bias Removal in Computer Vision Models (ICCVW OODCV 2023 paper)☆16Updated 6 months ago
- Easily compute model embeddings and save the embeddings.☆10Updated 3 years ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Updated 3 years ago
- Load any clip model with a standardized interface☆22Updated 2 months ago
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆68Updated 3 years ago
- Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification. ECCV 2022.☆18Updated 3 years ago
- ☆25Updated 2 years ago
- [NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fi…☆85Updated 2 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆98Updated 2 years ago
- PyTorch code for MUST☆108Updated 8 months ago
- ViT trained on COYO-Labeled-300M dataset☆33Updated 3 years ago
- Official implementation of Data-Free Sketch-Based Image Retrieval, CVPR 2023.☆27Updated 2 years ago
- Un-*** 50 billions multimodality dataset☆23Updated 3 years ago
- DoodleFormer: Creative Sketch Drawing with Transformers (ECCV22)☆31Updated 3 years ago
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models" ICLR 2024☆110Updated last year
- Official repository of the paper "GPR1200: A Benchmark for General-PurposeContent-Based Image Retrieval"☆29Updated 9 months ago
- Video descriptions of research papers relating to foundation models and scaling☆30Updated 2 years ago
- Evaluation benchmark for the task of Semantic Image Translation. Contains code to run FlexIT (CVPR 2022)☆34Updated 3 years ago