yibingwei-1 / LatentMIM
[ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning"
☆26Updated 2 weeks ago
Alternatives and similar repositories for LatentMIM:
Users that are interested in LatentMIM are comparing it to the libraries listed below
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆33Updated 9 months ago
- ☆58Updated last year
- ☆26Updated 2 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆76Updated 7 months ago
- (ICLR 2024, CVPR 2024) SparseFormer☆73Updated 4 months ago
- Official code for "DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut", NeurIPS 202…☆37Updated 2 months ago
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆18Updated 4 months ago
- Official implementation of the WACV 2024 paper CLIP-DIY☆34Updated last year
- [CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations☆48Updated 2 weeks ago
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆73Updated 3 months ago
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding☆43Updated 2 months ago
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆37Updated 3 months ago
- Official repository of paper "Subobject-level Image Tokenization"☆65Updated 10 months ago
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆39Updated 5 months ago
- [CVPR24] Official Implementation of GEM (Grounding Everything Module)☆113Updated 5 months ago
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆20Updated 5 months ago
- Official implementation of "Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive" (ICLR 2024)☆52Updated 6 months ago
- ☆52Updated 2 years ago
- ☆16Updated last year
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆32Updated 9 months ago
- ☆40Updated 5 months ago
- [CVPR 2023] Zero-shot Generative Model Adaptation via Image-specific Prompt Learning☆84Updated last year
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆44Updated 4 months ago
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)☆34Updated last month
- Personalized Representation from Personalized Generation (ICLR 2025)☆54Updated 2 weeks ago
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆37Updated last year
- Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.☆96Updated last week
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆31Updated 3 months ago