wkvong / multimodal-babyLinks
☆35Updated 5 months ago
Alternatives and similar repositories for multimodal-baby
Users that are interested in multimodal-baby are comparing it to the libraries listed below
Sorting:
- Menagerie of models trained on SAYCam (and more)☆23Updated last year
- [NeurIPS2022] Mind Reader: Reconstructing complex images from brain activities☆62Updated 2 years ago
- ☆20Updated 11 months ago
- ☆227Updated 3 months ago
- An approach to building pure vision foundation models by prompting masked predictors with "counterfactual" visual inputs.☆29Updated 2 years ago
- ☆38Updated 5 months ago
- Sparse Linear Concept Embeddings☆112Updated 5 months ago
- [Algonauts 2023] PyTorch implementation of "Memory Encoding Model"☆58Updated last year
- HT-Step is a large-scale article grounding dataset of temporal step annotations on how-to videos☆20Updated last year
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆99Updated last year
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆56Updated 3 months ago
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆53Updated 7 months ago
- Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset☆458Updated last week
- The Social-IQ 2.0 Challenge Release for the Artificial Social Intelligence Workshop at ICCV '23☆32Updated last year
- ☆71Updated last year
- [NeurIPS 2023] A faithful benchmark for vision-language compositionality☆83Updated last year
- Official Code for Neural Systematic Binder☆33Updated 2 years ago
- ☆120Updated 2 years ago
- maze datasets for investigating OOD behavior of ML systems☆53Updated 2 weeks ago
- ☆41Updated last year
- Repo for "Physion: Evaluating Physical Prediction from Vision in Humans and Machines", presented at NeurIPS 2021 (Datasets & Benchmarks t…☆73Updated 2 years ago
- Pytorch Implementation of paper "Object-Centric Learning with Slot Attention"☆99Updated last year
- ☆125Updated last year
- Code for "Is CLIP ideal? No. Can we fix it? Yes!"☆16Updated 5 months ago
- Official repository for the paper "Brain-Diffuser: Natural scene reconstruction from fMRI signals using generative latent diffusion" by F…☆133Updated 2 years ago
- The Continual Learning in Multimodality Benchmark☆67Updated 2 years ago
- [ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…☆37Updated 6 months ago
- [NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models☆158Updated 8 months ago
- ☆11Updated 6 months ago
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆58Updated last year