arubique / OCCAMLinks

This is an implementation of the paper "Are We Done with Object-Centric Learning?"

☆11

Alternatives and similar repositories for OCCAM

Users that are interested in OCCAM are comparing it to the libraries listed below

Sorting:

ethanlshen / HierNet
Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…
☆21Updated 2 years ago
jonkahana / CLIPPR
An official PyTorch implementation for CLIPPR
☆29Updated 2 years ago
CVMI-Lab / clip-beyond-tail
(NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights
☆29Updated last year
amitakamath / vl_text_encoders_are_bottlenecks
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11Updated 2 years ago
hammoudhasan / DiversitySSL
Original code base for On Pretraining Data Diversity for Self-Supervised Learning
☆14Updated 10 months ago
tsb0601 / MultiMon
☆25Updated 2 years ago
sirkosophia / DIP
Official implementation of DIP: Unsupervised Dense In-Context Post-training of Visual Representations
☆46Updated 2 months ago
princeton-pli / VLM_S2H
Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?
☆15Updated 5 months ago
ExplainableML / fomo_in_flux
Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]
☆60Updated 11 months ago
brendel-group / clip-ood
Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)
☆10Updated last year
facebookresearch / SIE
Code for the paper Self-Supervised Learning of Split Invariant Equivariant Representations
☆30Updated 2 years ago
locuslab / T-MARS
Code for T-MARS data filtering
☆35Updated 2 years ago
kaist-ami / BEAF
[ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"
☆21Updated 7 months ago
jeykigung / HiCLIP
☆30Updated 2 years ago
AllanYangZhou / generative-invariance-transfer
☆26Updated 3 years ago
bfshi / VARS
Official code for `Visual Attention Emerges from Recurrent Sparse Reconstruction' (ICML 2022)
☆36Updated 3 years ago
ggjy / vision_weak_to_strong
☆37Updated last year
mshukor / eP-ALM
[ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.
☆27Updated 2 years ago
k1rezaei / Text-to-concept
☆35Updated last year
facebookresearch / ViP-MAE
This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision
☆36Updated 2 years ago
ml-jku / semantic-image-text-alignment
☆25Updated 2 years ago
ytaek-oh / vl_compo
☆10Updated last year
ziplab / SN-Netv2
[ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".
☆28Updated last year
james-oldfield / muMoE
[NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization
☆37Updated last year
YannDubs / Mini_Decodable_Information_Bottleneck
Minimum viable code for the Decodable Information Bottleneck paper. Pytorch Implementation.
☆11Updated 5 years ago
BatsResearch / ex2
If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions
☆17Updated last year
EPFL-VILAB / XDEnsembles
Robustness via Cross-Domain Ensembles, ICCV 2021 [Oral]
☆39Updated 4 years ago
alinlab / b2t
Bias-to-Text: Debiasing Unknown Visual Biases through Language Interpretation
☆31Updated 2 years ago
kdariina / CLIP-not-BoW-unimodally
Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"
☆16Updated 9 months ago
deeplearning-wisc / NSCL
Code for ICML 2023 paper "When and How Does Known Class Help Discover Unknown Ones? Provable Understandings Through Spectral Analysis"
☆13Updated 2 years ago