ExplainableML / fomo_in_fluxLinks
Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]
☆57Updated 8 months ago
Alternatives and similar repositories for fomo_in_flux
Users that are interested in fomo_in_flux are comparing it to the libraries listed below
Sorting:
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆33Updated 2 years ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆21Updated last year
- [ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models☆77Updated 3 months ago
- [NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"☆12Updated last year
- [NeurIPS'24] Official PyTorch implementation for paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"☆24Updated 6 months ago
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Updated last year
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Updated last year
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆42Updated last year
- [NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization☆33Updated 10 months ago
- ☆34Updated last year
- Distributed Optimization Infra for learning CLIP models☆27Updated 10 months ago
- ☆52Updated 7 months ago
- Test-Time Distribution Normalization For Contrastively Learned Vision-language Models☆27Updated last year
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆47Updated this week
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆10Updated last year
- Data-Efficient Multimodal Fusion on a Single GPU☆66Updated last year
- https://arxiv.org/abs/2209.15162☆52Updated 2 years ago
- ☆38Updated last year
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆21Updated 8 months ago
- Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models☆32Updated 4 months ago
- Official Code for ICLR 2024 Paper: Non-negative Contrastive Learning☆45Updated last year
- An official PyTorch implementation for CLIPPR☆29Updated 2 years ago
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated 2 years ago
- Code for T-MARS data filtering☆35Updated 2 years ago
- Model Merging with SVD to Tie the KnOTS [ICLR 2025]☆63Updated 4 months ago
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆34Updated 2 years ago
- Patching open-vocabulary models by interpolating weights☆91Updated last year
- Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆16Updated 9 months ago
- Code for ICLR'24 workshop ME-FoMo-How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation☆37Updated 10 months ago
- Compress conventional Vision-Language Pre-training data☆52Updated last year