bethgelab / frequency_determines_performanceLinks

Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurIPS'24]

☆92

Alternatives and similar repositories for frequency_determines_performance

Users that are interested in frequency_determines_performance are comparing it to the libraries listed below

Sorting:

mlfoundations / patching
Patching open-vocabulary models by interpolating weights
☆91Updated 2 years ago
g-luo / vlm_cross_modal_reps
Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025
☆31Updated 6 months ago
jonathan-roberts1 / zerobench
Code, Data and Red Teaming for ZeroBench
☆50Updated 6 months ago
AtsuMiyai / UPD
[ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models
☆78Updated 5 months ago
pliang279 / HEMM
Holistic evaluation of multimodal foundation models
☆47Updated last year
SriramB-98 / vit-decompose
☆23Updated 10 months ago
togethercomputer / Dragonfly
☆80Updated last year
k1rezaei / Text-to-concept
☆35Updated last year
alhojel / visual_task_vectors
☆39Updated last year
anguyen8 / vision-llms-are-blind
☆135Updated 2 months ago
apple / ml-tic-clip
Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models" ICLR 2024
☆108Updated last year
wmn-231314 / diffusion-data-constraint
Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…
☆108Updated 3 weeks ago
katiekang1998 / reasoning_generalization
☆33Updated 10 months ago
Understanding-Visual-Datasets / VisDiff
Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)
☆127Updated 2 weeks ago
facebookresearch / unibench
Python Library to evaluate VLM models' robustness across diverse benchmarks
☆219Updated 3 weeks ago
ChenWu98 / algorithmic-creativity
[ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction
☆74Updated 5 months ago
oripress / EntropyEnigma
Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"
☆55Updated last year
multimodal-interpretability / maia
Official implementation of MAIA, A Multimodal Automated Interpretability Agent
☆94Updated 3 weeks ago
bfshi / TOAST
Official code for "TOAST: Transfer Learning via Attention Steering"
☆186Updated 2 years ago
locuslab / T-MARS
Code for T-MARS data filtering
☆35Updated 2 years ago
gregorbachmann / scaling_mlps
☆52Updated last year
hila-chefer / Conceptor
Official implementation of the paper The Hidden Language of Diffusion Models
☆74Updated last year
UCDvision / NOLA
Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"
☆56Updated last year
noranta4 / ASIF
Personal implementation of ASIF by Antonio Norelli
☆26Updated last year
yossigandelsman / rosetta_neurons
Official PyTorch Implementation of "Rosetta Neurons: Mining the Common Units in a Model Zoo"
☆31Updated 2 years ago
mlfoundations / dataset2metadata
☆27Updated last year
EvolvingLMMs-Lab / multimodal-sae
[ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.
☆163Updated last month
taesiri / ZoomIsAllYouNeed
Official code and data for NeurIPS 2023 paper "ImageNet-Hard: The Hardest Images Remaining from a Study of the Power of Zoom and Spatial …
☆40Updated last year
iancovert / locality-alignment
☆53Updated 10 months ago
ExplainableML / fomo_in_flux
Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]
☆60Updated 11 months ago