This is an implementation of the paper "Are We Done with Object-Centric Learning?"
☆12Sep 11, 2025Updated 6 months ago
Alternatives and similar repositories for OCCAM
Users that are interested in OCCAM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jan 22, 2025Updated last year
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆22Nov 8, 2023Updated 2 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- ☆12Oct 4, 2023Updated 2 years ago
- [CVPR 2024 Highlight] ImageNet-D☆47Oct 15, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"☆27Feb 27, 2026Updated 3 weeks ago
- ☆48Jan 17, 2023Updated 3 years ago
- Official codebase for the NeurIPS 2023 paper: Towards Last-layer Retraining for Group Robustness with Fewer Annotations. https://arxiv.or…☆12May 15, 2024Updated last year
- ☆12Jun 12, 2023Updated 2 years ago
- Code for the CCE algorithm proposed in "Towards Compositionality in Concept Learning" at ICML 2024.☆16Jun 2, 2024Updated last year
- PyTorch code and pretrained weights for the UNIC models.☆44Aug 29, 2024Updated last year
- Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning (ICML 2023)☆19Dec 15, 2023Updated 2 years ago
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆15Jun 4, 2025Updated 9 months ago
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆18Jun 3, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 4 months ago
- Official Code for MIMETIC^2☆13Nov 19, 2024Updated last year
- Source code for the paper "Do Deep Neural Network Solutions form a Star Domain?"☆12May 26, 2024Updated last year
- ☆14Updated this week
- (ICML 2023) Discover and Cure: Concept-aware Mitigation of Spurious Correlation☆45Nov 17, 2025Updated 4 months ago
- Enhanced Unsupervised Object Discoveries through Exhaustive Self-Supervised Transformers☆15Jun 25, 2024Updated last year
- Collaborative retina modelling across datasets and species.☆18Updated this week
- Code implementation of our ICCV 2025 paper: On Large Multimodal Models as Open-World Image Classifiers☆26Dec 4, 2025Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆29Apr 27, 2024Updated last year
- PHASE annotations for societal bias in vision-and-language tasks.☆17Jun 18, 2024Updated last year
- ☆41Jan 26, 2026Updated last month
- ☆21Apr 10, 2023Updated 2 years ago
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated last year
- PyTorch code corresponding to my blog series on adversarial examples and (confidence-calibrated) adversarial training.☆67Apr 26, 2023Updated 2 years ago
- ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]☆21Aug 21, 2025Updated 7 months ago
- Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"☆33Mar 15, 2024Updated 2 years ago
- ☆18May 25, 2018Updated 7 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code for the paper "Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real Documentss"☆15Oct 8, 2024Updated last year
- [CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding☆56Apr 7, 2025Updated 11 months ago
- MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments☆13Jul 8, 2024Updated last year
- ImageNet-12k subset of ImageNet-21k (fall11)☆22Jun 13, 2023Updated 2 years ago
- SSH tunneling daemon☆21Jan 19, 2025Updated last year
- Sketched linear operations for PyTorch☆101Oct 24, 2025Updated 5 months ago
- ZeroC is a neuro-symbolic method that trained with elementary visual concepts and relations, can zero-shot recognize and acquire more com…☆33May 8, 2023Updated 2 years ago