bethgelab / frequency_determines_performanceLinks
Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurIPS'24]
☆90Updated last year
Alternatives and similar repositories for frequency_determines_performance
Users that are interested in frequency_determines_performance are comparing it to the libraries listed below
Sorting:
- ☆41Updated 11 months ago
- Patching open-vocabulary models by interpolating weights☆91Updated last year
- ☆22Updated 6 months ago
- Code, Data and Red Teaming for ZeroBench☆46Updated 2 months ago
- Holistic evaluation of multimodal foundation models☆48Updated 11 months ago
- ☆51Updated last year
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆53Updated last year
- ☆127Updated 10 months ago
- [ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models☆77Updated last month
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".☆102Updated last year
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆27Updated 2 months ago
- Code for T-MARS data filtering☆35Updated last year
- Official implementation of the paper The Hidden Language of Diffusion Models☆74Updated last year
- ☆33Updated 6 months ago
- ☆33Updated last year
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆82Updated 3 weeks ago
- Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.☆81Updated 2 months ago
- ☆38Updated 11 months ago
- Recycling diverse models☆45Updated 2 years ago
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆34Updated 2 years ago
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆36Updated last year
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆56Updated 10 months ago
- [CVPR 2025 🔥] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses the…☆42Updated last month
- [Under Review] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with enla…☆60Updated 9 months ago
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆40Updated 9 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆65Updated 10 months ago
- Official PyTorch Implementation of "Rosetta Neurons: Mining the Common Units in a Model Zoo"☆30Updated last year
- 🔥 [ICLR 2025] Official Benchmark Toolkits for "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"☆29Updated 5 months ago
- [ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…☆73Updated 10 months ago