yibingwei-1/LatentMIM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yibingwei-1/LatentMIM)

yibingwei-1 / LatentMIM

[ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning"

☆30

Alternatives and similar repositories for LatentMIM

Users that are interested in LatentMIM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

stoneMo / DeepAVFusion
View on GitHub
Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".
☆43Aug 2, 2024Updated last year
biomedia-mira / flow-ssn
View on GitHub
[ICCV 2025] Code for "Flow Stochastic Segmentation Networks"
☆17Jun 9, 2026Updated last month
NimrodShabtay / positional-encoding-image-prior
View on GitHub
Official implementation of "Positional-encoding Image Prior" (PIP)
☆18Mar 1, 2023Updated 3 years ago
mu-cai / FRL
View on GitHub
Code for WACV 2023 paper "Out-of-distribution Detection via Frequency-regularized Generative Models" by Mu Cai and Yixuan Li
☆11May 1, 2023Updated 3 years ago
rafalkarczewski / spacetime-geometry
View on GitHub
The Spacetime of Diffusion Models: An Information Geometry Perspective (ICLR 2026 Oral)
☆48Feb 21, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zhenghuizhao / ChangeBridge
View on GitHub
[CVPR2026] ChangeBridge: Spatiotemporal Image Generation with Multimodal Controls for Remote Sensing
☆17Jun 7, 2026Updated last month
diaoquesang / WDT-MD
View on GitHub
[AAAI 2026] WDT-MD: Wavelet Diffusion Transformers for Microaneurysm Detection in Fundus Images
☆16Mar 25, 2026Updated 3 months ago
bowang-lab / AMOS-MM-Solution
View on GitHub
Solution to the AMOS-MM challenge
☆16Sep 13, 2025Updated 10 months ago
lingli1996 / GeoReasoner
View on GitHub
[ICML 2024] GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model
☆74Feb 1, 2026Updated 5 months ago
qimaqi / CityLoc
View on GitHub
Offical implementation of work 6 DoF Localization of Text Descriptions in Large-Scale Scenes with Gaussian Representation
☆19Feb 5, 2025Updated last year
fpv-iplab / stillfast
View on GitHub
Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…
☆13Apr 11, 2023Updated 3 years ago
ffhibnese / CGNC_Targeted_Adversarial_Attacks
View on GitHub
[ECCV-2024] Transferable Targeted Adversarial Attack, CLIP models, Generative adversarial network, Multi-target attacks
☆39Apr 23, 2025Updated last year
GeWu-Lab / Stepping-Stones
View on GitHub
The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024
☆18Oct 11, 2024Updated last year
jhuldr / Brain-ID
View on GitHub
[ECCV 2024] Brain-ID: Learning Contrast-agnostic Anatomical Representations for Brain Imaging
☆38Jan 31, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
visinf / funnybirds-framework
View on GitHub
FunnyBirds: A Synthetic Vision Dataset for a Part-Based Analysis of Explainable AI Methods (ICCV 2023)
☆17Apr 8, 2024Updated 2 years ago
narthchin / DEIQT
View on GitHub
Checkpoints, logs and source code for AAAI-23 paper 'Data-Efficient Image Quality Assessment with Attention-Panel Decoder'
☆39Apr 3, 2024Updated 2 years ago
lezhang7 / RiT
View on GitHub
PyTorch implementation of RiT: Vanilla Diffusion Transformers Suffice in Representation Space
☆26May 23, 2026Updated last month
fraunhoferhhi / spvloc
View on GitHub
[ECCV'24 Oral] SPVLoc estimates 6D camera pose by matching images to semantic 3D models of indoor scenes, without scene-specific training…
☆45Mar 4, 2026Updated 4 months ago
FengheTan9 / Hi-End-MAE
View on GitHub
[MedIA 2026] Hi-End-MAE: Hierarchical encoder-driven masked autoencoders are stronger vision learners for medical image segmentation
☆33Feb 16, 2026Updated 5 months ago
TonyXuQAQ / UnofficialLaneExtraction
View on GitHub
Unofficial version of LaneExtraction
☆13Oct 12, 2022Updated 3 years ago
HimangiM / RepLAI
View on GitHub
Self-supervised algorithm for learning representations from ego-centric video data. Code is tested on EPIC-Kitchens-100 and Ego4D in PyTo…
☆13Oct 23, 2022Updated 3 years ago
prateksha / ScaleSpaceDiffusion
View on GitHub
[CVPR 2026] Scale Space Diffusion
☆30Jul 9, 2026Updated last week
TianheWu / MLLMs-for-IQA
View on GitHub
[ECCV 2024] Official Pytorch Implementation of A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment
☆94Jul 20, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
brown-palm / goal-force
View on GitHub
Official implementation of "Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals" (CVPR 2026)
☆41Feb 25, 2026Updated 4 months ago
yunshuwu / ContrastiveDiffusionLoss
View on GitHub
Official repo for Contrastive Diffusion Loss
☆14Dec 12, 2024Updated last year
SuhoPark0706 / TBSNet
View on GitHub
Offical Code for TBSNet(AAAI 2024)
☆14Feb 17, 2024Updated 2 years ago
XLearning-SCU / 2024-NeurIPS-AverNet
View on GitHub
Code for the paper "AverNet: All-in-one Video Restoration for Time-varying Unknown Degradations" (NeurIPS 2024)
☆36Oct 29, 2024Updated last year
anti-fake / DeepfakeDetector
View on GitHub
☆16Dec 29, 2025Updated 6 months ago
anti-fake / DeepfakeGenerator
View on GitHub
☆16Dec 29, 2025Updated 6 months ago
visinf / self-adaptive
View on GitHub
Semantic Self-adaptation: Enhancing Generalization with a Single Sample (TMLR 2023)
☆18Jul 21, 2023Updated 3 years ago
iecashhy / RS-vHeat
View on GitHub
☆15Oct 14, 2025Updated 9 months ago
jiwoogit / DCP-GAN
View on GitHub
[CVPR 2024] Diversity-aware Channel Pruning for StyleGAN Compression
☆26Jul 23, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
prs-eth / DGInStyle-SegModel
View on GitHub
Downstream semantic segmentation evaluation of DGInStyle.
☆25Apr 1, 2024Updated 2 years ago
mzeeshankaramat / SafeAgents
View on GitHub
☆20Jun 4, 2026Updated last month
LiYingwei / Regional-Homogeneity
View on GitHub
Source code for Regional Homogeneity: Towards Learning Transferable Universal Adversarial Perturbations Against Defenses (ECCV 2020)
☆42Apr 2, 2019Updated 7 years ago
PRITHIVSAKTHIUR / Imagineo-4K
View on GitHub
Midjourney X Instant Collage -- Collage Template + Grid + Quality Style
☆12May 25, 2025Updated last year
visinf / primaps
View on GitHub
Boosting Unsupervised Semantic Segmentation with Principal Mask Proposals (TMLR 2024)
☆19Nov 27, 2024Updated last year
kaist-cvml / PixelREPA
View on GitHub
[ECCV 2026] Official code of "Representation Alignment for Just Image Transformers is not Easier than You Think"
☆47Jun 18, 2026Updated last month
mila-iqia / Casande-RL
View on GitHub
Casande-RL
☆11May 9, 2023Updated 3 years ago