wkvong / multimodal-baby
☆30Updated 6 months ago
Alternatives and similar repositories for multimodal-baby:
Users that are interested in multimodal-baby are comparing it to the libraries listed below
- Menagerie of models trained on SAYCam (and more)☆21Updated 9 months ago
- ☆38Updated 2 years ago
- Library for the training and evaluation of object-centric models (ICML 2022)☆65Updated last year
- Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models☆104Updated last year
- ☆40Updated 11 months ago
- Repo for "Physion: Evaluating Physical Prediction from Vision in Humans and Machines", presented at NeurIPS 2021 (Datasets & Benchmarks t…☆61Updated last year
- Official Code for Neural Systematic Binder☆30Updated last year
- An approach to building pure vision foundation models by prompting masked predictors with "counterfactual" visual inputs.☆25Updated last year
- Official code for Slot-Transformer for Videos (STEVE)☆49Updated 2 years ago
- This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes …☆84Updated 2 years ago
- Natural Language Descriptions of Deep Visual Features, ICLR 2022☆62Updated last year
- ☆24Updated 8 months ago
- [NeurIPS2022] Mind Reader: Reconstructing complex images from brain activities☆61Updated 2 years ago
- Pytorch Implementation of paper "Object-Centric Learning with Slot Attention"☆86Updated last year
- ☆18Updated 4 months ago
- [NeurIPS 2021 Spotlight] Learning to Compose Visual Relations☆101Updated last year
- ☆34Updated 4 months ago
- ☆71Updated last year
- [CVPR 2023] Learning Visual Representations via Language-Guided Sampling☆146Updated last year
- The Continual Learning in Multimodality Benchmark☆63Updated last year
- Ying Nian Wu's UCLA Statistical Machine Learning Tutorial on generative modeling.☆54Updated 2 years ago
- ☆10Updated 2 years ago
- ElasticTok: Adaptive Tokenization for Image and Video☆46Updated 2 months ago
- [NeurIPS 2023] Learning Energy-Based Prior Model with Diffusion-Amortized MCMC☆13Updated last year
- Sparse Linear Concept Embeddings☆80Updated 5 months ago
- Visual Language Transformer Interpreter - An interactive visualization tool for interpreting vision-language transformers☆86Updated last year
- Code for the paper "Contrastive Learning Inverts the Data Generating Process".☆89Updated 5 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆81Updated last year
- ☆72Updated 2 years ago
- ☆27Updated 7 months ago