bethgelab / frequency_determines_performance
Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurIPS'24]
☆89Updated 11 months ago
Alternatives and similar repositories for frequency_determines_performance:
Users that are interested in frequency_determines_performance are comparing it to the libraries listed below
- ☆41Updated 9 months ago
- Official implementation of the paper The Hidden Language of Diffusion Models☆72Updated last year
- ☆32Updated last year
- Code, Data and Red Teaming for ZeroBench☆45Updated 2 months ago
- Code release for "Improved baselines for vision-language pre-training"☆60Updated 11 months ago
- Patching open-vocabulary models by interpolating weights☆91Updated last year
- ☆22Updated 3 months ago
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆79Updated last month
- Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).☆67Updated this week
- Official PyTorch Implementation of "Rosetta Neurons: Mining the Common Units in a Model Zoo"☆30Updated last year
- ☆118Updated 7 months ago
- Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models☆76Updated 7 months ago
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆33Updated last year
- Code for T-MARS data filtering☆35Updated last year
- ☆37Updated 9 months ago
- [ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…☆67Updated 7 months ago
- ☆31Updated 3 months ago
- [ICLR 2025] Video Action Differencing☆36Updated last month
- 🔥 [ICLR 2025] Official Benchmark Toolkits for "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"☆26Updated 2 months ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆50Updated 11 months ago
- Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation☆28Updated 3 months ago
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆36Updated last year
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆73Updated 4 months ago
- ☆29Updated 2 years ago
- M4 experiment logbook☆57Updated last year
- Official code and data for NeurIPS 2023 paper "ImageNet-Hard: The Hardest Images Remaining from a Study of the Power of Zoom and Spatial …☆38Updated last year
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆20Updated last year
- Recursive Visual Programming (ECCV 2024)☆17Updated 5 months ago
- ☆51Updated 10 months ago
- ☆36Updated 2 years ago