wkvong / multimodal-babyLinks
☆34Updated 4 months ago
Alternatives and similar repositories for multimodal-baby
Users that are interested in multimodal-baby are comparing it to the libraries listed below
Sorting:
- Menagerie of models trained on SAYCam (and more)☆23Updated last year
- ☆20Updated 10 months ago
- Pytorch Implementation of paper "Object-Centric Learning with Slot Attention"☆97Updated last year
- An approach to building pure vision foundation models by prompting masked predictors with "counterfactual" visual inputs.☆28Updated last year
- [NeurIPS2022] Mind Reader: Reconstructing complex images from brain activities☆62Updated 2 years ago
- [Algonauts 2023] PyTorch implementation of "Memory Encoding Model"☆58Updated last year
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆53Updated 6 months ago
- Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models☆112Updated last year
- Official Code for Neural Systematic Binder☆33Updated 2 years ago
- Official code for Slot-Transformer for Videos (STEVE)☆49Updated 2 years ago
- This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes …☆86Updated 2 years ago
- ☆41Updated last year
- Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset☆453Updated this week
- ☆38Updated 4 months ago
- ☆39Updated 3 years ago
- ☆79Updated 2 years ago
- Sparse Linear Concept Embeddings☆110Updated 4 months ago
- This repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample …☆57Updated last year
- A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application☆299Updated 6 months ago
- [NeurIPS 23' Oral] Emergence of Shape Bias in Convolutional Neural Networks through Activation Sparsity☆26Updated last year
- ☆170Updated 2 years ago
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆55Updated 3 months ago
- Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023☆176Updated last year
- Code for the paper: Rotating Features for Object Discovery☆53Updated 11 months ago
- Library for the training and evaluation of object-centric models (ICML 2022)☆68Updated 2 years ago
- Learning What and Where – Unsupervised Disentangling Location and Identity Tracking☆21Updated last year
- maze datasets for investigating OOD behavior of ML systems☆51Updated this week
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆99Updated last year
- Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning☆158Updated 2 years ago
- ☆220Updated 2 months ago