serre-lab / CVRLinks
A Benchmark for Efficient and Compositional Visual Reasoning
☆25Updated 2 years ago
Alternatives and similar repositories for CVR
Users that are interested in CVR are comparing it to the libraries listed below
Sorting:
- Official code for `Visual Attention Emerges from Recurrent Sparse Reconstruction' (ICML 2022)☆36Updated 3 years ago
- ☆65Updated 3 years ago
- Patching open-vocabulary models by interpolating weights☆91Updated 2 years ago
- [NeurIPS 2021] Code for Unsupervised Learning of Compositional Energy Concepts☆62Updated 3 years ago
- ☆42Updated last year
- Visual Representation Learning Benchmark for Self-Supervised Models☆35Updated last year
- ☆25Updated 2 years ago
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆34Updated 2 years ago
- ☆62Updated 3 years ago
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Updated last year
- ☆120Updated 2 years ago
- ☆26Updated 3 years ago
- [NeurIPS 2021 Spotlight] Learning to Compose Visual Relations☆102Updated 2 years ago
- Code for the paper Self-Supervised Learning of Split Invariant Equivariant Representations☆30Updated 2 years ago
- Pytorch Implementation of paper "Object-Centric Learning with Slot Attention"☆103Updated 2 years ago
- [NeurIPS'20] Code for the Paper Compositional Visual Generation and Inference with Energy Based Models☆47Updated 2 years ago
- Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation☆103Updated 9 months ago
- Stochastic Optimization for Global Contrastive Learning without Large Mini-batches☆20Updated 2 years ago
- Object-aware Contrastive Learning for Debiased Scene Representation (NeurIPS 2021)☆45Updated 4 years ago
- Code for the ICCV 2023 paper "Benchmarking Low-Shot Robustness to Natural Distribution Shifts"☆11Updated last year
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 3 years ago
- The Continual Learning in Multimodality Benchmark☆68Updated 2 years ago
- Natural Language Descriptions of Deep Visual Features, ICLR 2022☆65Updated 2 years ago
- ImageNetV2 Pytorch Dataset☆42Updated 2 years ago
- Codebase for the paper titled "Continual learning with local module selection"☆25Updated 4 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Updated 2 years ago
- Codebase used in the paper "Foundational Models for Continual Learning: An Empirical Study of Latent Replay".☆30Updated 2 years ago
- https://arxiv.org/abs/2209.15162☆53Updated 2 years ago
- [CogSci'21] Study of human inductive biases in CNNs and Transformers.☆43Updated 4 years ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆19Updated 3 years ago