(NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights
☆28Oct 28, 2024Updated last year
Alternatives and similar repositories for clip-beyond-tail
Users that are interested in clip-beyond-tail are comparing it to the libraries listed below
Sorting:
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆10Aug 26, 2024Updated last year
- Can 3D Vision-Language Models Truly Understand Natural Language?☆20Mar 28, 2024Updated last year
- Python package to download and use the SSB datasets☆11Aug 3, 2023Updated 2 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆14Sep 30, 2023Updated 2 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- (NeurIPS 2022) Self-Supervised Visual Representation Learning with Semantic Grouping☆97Mar 10, 2025Updated 11 months ago
- Code for ICML 2023 paper "When and How Does Known Class Help Discover Unknown Ones? Provable Understandings Through Spectral Analysis"☆14Jun 24, 2023Updated 2 years ago
- (ECCV 2024) Can OOD Object Detectors Learn from Foundation Models?☆25Dec 7, 2024Updated last year
- ☆29Oct 18, 2022Updated 3 years ago
- This repository houses the code for the paper - "The Neglected of VLMs"☆30Dec 31, 2025Updated 2 months ago
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Nov 12, 2024Updated last year
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".☆31Nov 16, 2022Updated 3 years ago
- (CVPR 2025) A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning☆24Mar 11, 2025Updated 11 months ago
- (ICCV 2025) "Principal Components" Enable A New Language of Images☆79Jul 28, 2025Updated 7 months ago
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆46Dec 1, 2024Updated last year
- Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurI…☆94Apr 29, 2024Updated last year
- ☆11Aug 7, 2025Updated 6 months ago
- ☆10Jul 5, 2024Updated last year
- Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation. Bingchen Zhao and Kai Han. (NeurIPS 2021)☆12Aug 20, 2023Updated 2 years ago
- Phys4DGen: A Physics-Driven Framework for Controllable and Efficient 4D Content Generation from a Single Image☆12May 10, 2025Updated 9 months ago
- ☆19Updated this week
- An official PyTorch implementation for CLIPPR☆30Jul 22, 2023Updated 2 years ago
- Is synthetic data from generative models ready for image recognition?☆187Feb 16, 2023Updated 3 years ago
- 2nd place solution of ECCV 2020 workshop VIPriors Image Classification Challenge, https://arxiv.org/abs/2008.00261☆13Aug 22, 2021Updated 4 years ago
- 计算几何算法模板 | Computational geometry algorithm library assembled by myself, helped our team(Vegetables of Tongji) win silver medal in the IC…☆12Jul 8, 2019Updated 6 years ago
- Benchmarking Multi-Image Understanding in Vision and Language Models☆12Jul 29, 2024Updated last year
- Original code base for On Pretraining Data Diversity for Self-Supervised Learning☆14Dec 30, 2024Updated last year
- ☆24Jul 16, 2025Updated 7 months ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆15Jun 3, 2025Updated 9 months ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Sep 11, 2025Updated 5 months ago
- ☆13Jan 22, 2025Updated last year
- ☆13May 12, 2025Updated 9 months ago
- ☆13Jul 2, 2025Updated 8 months ago
- ☆11Oct 2, 2024Updated last year
- Distributed Optimization Infra for learning CLIP models☆27Oct 3, 2024Updated last year
- Exploring prompt tuning with pseudolabels for multiple modalities, learning settings, and training strategies.☆51Nov 8, 2024Updated last year
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆62Dec 10, 2024Updated last year
- (ICCV2023) IST-Net: Prior-free Category-level Pose Estimation with Implicit Space Transformation☆119Dec 7, 2023Updated 2 years ago