yibingwei-1 / LatentMIM
[ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning"
☆27Updated last month
Alternatives and similar repositories for LatentMIM:
Users that are interested in LatentMIM are comparing it to the libraries listed below
- ☆27Updated 3 months ago
- Official repository of paper "Subobject-level Image Tokenization"☆69Updated 2 weeks ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆80Updated 3 weeks ago
- [CVPR 2025] Test-Time Visual In-Context Tuning☆22Updated 3 weeks ago
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆26Updated 3 months ago
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆76Updated 4 months ago
- Code and data for the paper "Emergent Visual-Semantic Hierarchies in Image-Text Representations" (ECCV 2024)☆27Updated 8 months ago
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆37Updated 4 months ago
- ☆41Updated 6 months ago
- ☆59Updated last year
- Rui Qian, Xin Yin, Dejing Dou†: Reasoning to Attend: Try to Understand How <SEG> Token Works (CVPR 2025)☆27Updated 2 weeks ago
- (ICLR 2024, CVPR 2024) SparseFormer☆73Updated 5 months ago
- Code for "How far can we go with ImageNet for Text-to-Image generation?" paper☆83Updated 3 weeks ago
- ☆29Updated last year
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding☆43Updated 3 months ago
- Official implementation of "Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive" (ICLR 2024)☆54Updated 7 months ago
- [CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis☆40Updated last year
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆37Updated last year
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆21Updated 6 months ago
- Adapters Strike Back (CVPR 2024)☆35Updated 8 months ago
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆40Updated 6 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆37Updated 10 months ago
- [CVPR 2023] Zero-shot Generative Model Adaptation via Image-specific Prompt Learning☆84Updated last year
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆54Updated last year
- [CVPR 2024 Highlight] ImageNet-D☆42Updated 6 months ago
- Official implementation of the WACV 2024 paper CLIP-DIY☆34Updated last year
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆23Updated 5 months ago
- ☆11Updated 9 months ago
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆43Updated 10 months ago
- ☆32Updated this week