yibingwei-1 / LatentMIMView external linksLinks
[ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning"
☆29Mar 5, 2025Updated 11 months ago
Alternatives and similar repositories for LatentMIM
Users that are interested in LatentMIM are comparing it to the libraries listed below
Sorting:
- [ECCV 2024 Oral] Audio-Synchronized Visual Animation☆57Sep 12, 2024Updated last year
- [NeurIPS 2024] Activating Self-Attention for Multi-Scene Absolute Pose Regression☆14Feb 24, 2025Updated 11 months ago
- Official implementation of "Positional-encoding Image Prior" (PIP)☆16Mar 1, 2023Updated 2 years ago
- ☆21Aug 8, 2024Updated last year
- Code for the paper "AverNet: All-in-one Video Restoration for Time-varying Unknown Degradations" (NeurIPS 2024)☆33Oct 29, 2024Updated last year
- [CVPR 2025] Official PyTorch implementation of Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability☆32Jul 1, 2025Updated 7 months ago
- [ICML 2024] GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model☆68Feb 1, 2026Updated 2 weeks ago
- (BMVC 2022--Oral) Official repository for "Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations" …☆34Jan 8, 2023Updated 3 years ago
- (TMLR 2026) The official Pytorch Implementation of AnyIR for All in One Image Restoration☆29Sep 18, 2025Updated 4 months ago
- [CVPR-25🔥] Test-time Counterattacks (TTC) towards adversarial robustness of CLIP☆39Jun 4, 2025Updated 8 months ago
- Implementation of NIPS2023: Unleashing the Full Potential of Product Quantization for Large-Scale Image Retrieva☆11Nov 12, 2024Updated last year
- [ECCV'24 Oral] SPVLoc estimates 6D camera pose by matching images to semantic 3D models of indoor scenes, without scene-specific training…☆41Dec 8, 2025Updated 2 months ago
- The Official Implementation of CFCD. Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval☆31Nov 3, 2023Updated 2 years ago
- Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".☆39Aug 2, 2024Updated last year
- ☆17Feb 4, 2026Updated last week
- Source code for the paper "Memory-Efficient Fine-Tuning via Low-Rank Activation Compression"☆13Aug 1, 2025Updated 6 months ago
- [Electronics Letters 2024] YOLOv9 for Fracture Detection in Pediatric Wrist Trauma X-ray Images☆35Aug 12, 2025Updated 6 months ago
- [CVPR 2024] Leveraging Vision-Language Models for Improving Domain Generalization in Image Classification☆39Mar 6, 2024Updated last year
- [ECCV-2024] Transferable Targeted Adversarial Attack, CLIP models, Generative adversarial network, Multi-target attacks☆38Apr 23, 2025Updated 9 months ago
- (CVPR 2025) Scailing Down Text Encoders of Text-to-Image Diffusion Models☆51Sep 10, 2025Updated 5 months ago
- Linear Attention for Efficient Bidirectional Sequence Modeling☆15May 13, 2025Updated 9 months ago
- LiT (Zero-Shot Transfer with Locked-image text Tuning) image and text encoder models, working in the browser☆11May 16, 2022Updated 3 years ago
- ☆45Oct 5, 2025Updated 4 months ago
- Project to Accompany my YouTube Video on this topic☆11Sep 28, 2024Updated last year
- Fine-tuned YOLO v8 model for Soccer and Ice Hockey☆12Jun 27, 2024Updated last year
- [CoRL 2025] Search-TTA: A Multimodal Test-Time Adaptation Framework for Visual Search in the Wild☆25Jan 23, 2026Updated 3 weeks ago
- A2B Neural Rendering of Ambisonic Recordings to Binaural☆18Aug 5, 2025Updated 6 months ago
- [ACM MM2025]: Unleashing the Power of Data Generation in One-Pass Outdoor LiDAR Localization☆18Oct 29, 2025Updated 3 months ago
- Code for the AAAI 2024 paper: "AGS: Affordable and Generalizable Substitute Training for Transferable Adversarial Attack" (accepted).☆12Mar 28, 2024Updated last year
- ☆22Mar 28, 2025Updated 10 months ago
- Create your React Native App in days, not months • incl. Figma File☆11Mar 14, 2025Updated 11 months ago
- Debiasing Through Data Attribution☆12May 23, 2024Updated last year
- ☆14Updated this week
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Sep 7, 2024Updated last year
- LaSSM: Efficient Semantic-Spatial Query Decoding via Local Aggregation and State Space Models for 3D Instance Segmentation☆16May 27, 2025Updated 8 months ago
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆43Dec 7, 2024Updated last year
- MineInsight: A Multi-spectral Dataset for Humanitarian Demining Robotics in Off-Road Environments☆17Updated this week
- Adapters Strike Back (CVPR 2024)☆44Jul 24, 2024Updated last year
- [ECCV 2024] Official Pytorch Implementation of A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment☆92Jul 20, 2024Updated last year