[ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning"
☆30Mar 5, 2025Updated last year
Alternatives and similar repositories for LatentMIM
Users that are interested in LatentMIM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024] Activating Self-Attention for Multi-Scene Absolute Pose Regression☆14Feb 24, 2025Updated last year
- Official implementation of "Positional-encoding Image Prior" (PIP)☆17Mar 1, 2023Updated 3 years ago
- The Spacetime of Diffusion Models: An Information Geometry Perspective (ICLR 2026 Oral)☆41Feb 21, 2026Updated 2 months ago
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Jun 1, 2023Updated 2 years ago
- Code for the paper "AverNet: All-in-one Video Restoration for Time-varying Unknown Degradations" (NeurIPS 2024)☆36Oct 29, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [CoRL 2025] Search-TTA: A Multimodal Test-Time Adaptation Framework for Visual Search in the Wild☆28Jan 23, 2026Updated 3 months ago
- [CVPR 2025] Official PyTorch implementation of Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability☆62May 13, 2026Updated last week
- ☆22Jul 3, 2025Updated 10 months ago
- [ECCV'24 Oral] SPVLoc estimates 6D camera pose by matching images to semantic 3D models of indoor scenes, without scene-specific training…☆43Mar 4, 2026Updated 2 months ago
- Official Codebase of "A Closer Look at Weakly-Supervised Audio-Visual Source Localization" (NeurIPS 2022)☆21Dec 6, 2022Updated 3 years ago
- ☆21Aug 8, 2024Updated last year
- [ICML 2024] GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model☆72Feb 1, 2026Updated 3 months ago
- [JAG'26] SpatialLLM: From Multi-modality Data to Urban Spatial Intelligence☆116Mar 5, 2026Updated 2 months ago
- codes for RFSR: Improving ISR Diffusion Models via Reward Feedback Learning☆18Dec 8, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ECCV-2024] Transferable Targeted Adversarial Attack, CLIP models, Generative adversarial network, Multi-target attacks☆40Apr 23, 2025Updated last year
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆44Dec 7, 2024Updated last year
- [NeurIPS 2025] Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging☆40Nov 4, 2025Updated 6 months ago
- ☆30Jul 23, 2025Updated 9 months ago
- HuiYanEarth-SAR(2026)☆32May 11, 2026Updated last week
- Official PyTorch Repository of "Minority-Oriented Vicinity Expansion with Attentive Aggregation for Video Long-Tailed Recognition" (AAAI …☆13Jul 27, 2023Updated 2 years ago
- Adapters Strike Back (CVPR 2024)☆44Jul 24, 2024Updated last year
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆18Oct 11, 2024Updated last year
- Checkpoints, logs and source code for AAAI-23 paper 'Data-Efficient Image Quality Assessment with Attention-Panel Decoder'☆39Apr 3, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- FunnyBirds: A Synthetic Vision Dataset for a Part-Based Analysis of Explainable AI Methods (ICCV 2023)☆17Apr 8, 2024Updated 2 years ago
- Harmonization of multi-site, multi-shell diffusion MRI☆14Oct 16, 2025Updated 7 months ago
- A compilation of network architectures for vision and others without usage of self-attention mechanism☆81Jan 18, 2023Updated 3 years ago
- Repository for "Enhanced Super-Resolution Training via Mimicked Alignment for Real-World Scenes", ACCV 2024☆16Dec 2, 2024Updated last year
- Unofficial version of LaneExtraction☆13Oct 12, 2022Updated 3 years ago
- Course: DD2412 Deep Learning Advanced at KTH Project by Casper, Magnus, and Friso Focus: Self-supervised learning and computer vision wit…☆12Dec 15, 2023Updated 2 years ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Sep 7, 2024Updated last year
- [NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization☆40Sep 30, 2024Updated last year
- Official repo for Contrastive Diffusion Loss☆14Dec 12, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ECCV 2024] Official Pytorch Implementation of A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment☆93Jul 20, 2024Updated last year
- M-SpecGene: Generalized Foundation Model for RGBT Multispectral Vision (ICCV 2025)☆36Nov 19, 2025Updated 6 months ago
- ☆14Dec 22, 2025Updated 4 months ago
- Official repo for [CVPR 2026] "SARMAE: Masked Autoencoder for SAR Representation Learning"☆40Apr 15, 2026Updated last month
- Use Stable Diffusion intrinsic lora to render texture maps (normal, albedo, shade, depth)☆16Mar 10, 2024Updated 2 years ago
- ☆46Oct 5, 2025Updated 7 months ago
- Material de la charla "The bad guys in AI - atacando sistemas de machine learning"☆16Nov 22, 2022Updated 3 years ago