WalterSimoncini / no-train-all-gainLinks
Code for the paper "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"
☆12Updated last year
Alternatives and similar repositories for no-train-all-gain
Users that are interested in no-train-all-gain are comparing it to the libraries listed below
Sorting:
- ☆15Updated 11 months ago
- Library implementation of "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆39Updated last year
- Collection of awesome Continual Test-Time Adaptation methods☆23Updated last year
- ☆10Updated last year
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆42Updated 11 months ago
- "Near, far: Patch-ordering enhances vision foundation models' scene understanding": A New SSL Post-Training Approach for Improving DINOv2…☆29Updated 6 months ago
- [CVPR2025] Official implementation of RAM☆24Updated 2 weeks ago
- [NeurIPS 2024] Activating Self-Attention for Multi-Scene Absolute Pose Regression☆13Updated 8 months ago
- Code for the paper "Compositional Entailment Learning for Hyperbolic Vision-Language Models".☆91Updated 5 months ago
- [ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning…☆29Updated 8 months ago
- ☆13Updated 7 months ago
- ☆27Updated last year
- Code for our paper: "Where's Waldo: Diffusion Features For Personalized Segmentation and Retrieval".☆15Updated 8 months ago
- Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models [CVPR 2025]☆75Updated 4 months ago
- Official repository of paper "Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval"☆10Updated last year
- [AAAI 2025] GFlow: Recovering 4D World from Monocular Video☆55Updated 6 months ago
- Segment This Thing is an efficient image segmentation models that uses a biologically-inspired foveated tokenization to reduce inference …☆53Updated 5 months ago
- ☆22Updated 11 months ago
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆19Updated last year
- This is the project for 'USG'.☆31Updated 7 months ago
- Official implementation of DIP: Unsupervised Dense In-Context Post-training of Visual Representations☆46Updated 2 months ago
- Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)☆23Updated 9 months ago
- [CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations☆113Updated 2 months ago
- [AAAI 2025] Official pytorch implementation of "Diffusion Model Patching via Mixture-of-Prompts"☆12Updated 11 months ago
- Official PyTorch implementation of HCCNet: Efficient Semantic Matching with Hypercolumn Correlation (WACV '24 Oral, Best paper finalist (…☆11Updated last year
- (NeurIPS 2023) Open-set visual object query search & localization in long-form videos☆25Updated last year
- [NeurIPS '25 Spotlight] Official Pytorch implementation of "Vision Transformers Don't Need Trained Registers"☆142Updated last month
- [ICLR 2025] CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion☆54Updated 4 months ago
- [ICML'25] Kernel-based Unsupervised Embedding Alignment for Enhanced Visual Representation in Vision-language Models☆18Updated 2 months ago
- [CVPR2025] Synthetic Data is an Elegant GIFT for Continual Vision-Language Models☆20Updated 4 months ago