mugen-org / MUGEN_coinrunLinks
A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset. This repo contains scripts to train RL agents to navigate the closed world and collect video data.
☆13Updated 3 years ago
Alternatives and similar repositories for MUGEN_coinrun
Users that are interested in MUGEN_coinrun are comparing it to the libraries listed below
Sorting:
- This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes …☆86Updated 2 years ago
- multimodal video-audio-text generation and retrieval between every pair of modalities on the MUGEN dataset. The repo. contains the traini…☆40Updated 2 years ago
- [NeurIPS 2021 Spotlight] Learning to Compose Visual Relations☆101Updated 2 years ago
- DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)☆142Updated 4 months ago
- ☆42Updated last year
- Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"☆144Updated 3 years ago
- Official PyTorch implementation of "Improving Generative Imagination in Object-Centric World Models"☆37Updated 2 years ago
- ☆97Updated 2 months ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆33Updated 2 years ago
- Official code for Slot-Transformer for Videos (STEVE)☆50Updated 2 years ago
- An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities☆174Updated 3 years ago
- ☆73Updated 3 years ago
- ☆39Updated 3 years ago
- ☆120Updated 2 years ago
- ☆46Updated last year
- ☆122Updated 8 months ago
- PyTorch implementation of ICLR 2020 paper "CLEVRER: CoLlision Events for Video REpresentation and Reasoning"☆121Updated 4 years ago
- Official code repository for the EMNLP 2021 paper☆26Updated 3 years ago
- Library for the training and evaluation of object-centric models (ICML 2022)☆70Updated 2 years ago
- Official code for our CVPR 2023 paper: Test of Time: Instilling Video-Language Models with a Sense of Time☆46Updated last year
- PyTorch re-implementation of Multi-Object Representation Learning with Iterative Variational Inference☆59Updated 3 years ago
- A list of papers and other resources on language-guided image editing.☆38Updated 4 years ago
- L-Verse: Bidirectional Generation Between Image and Text☆109Updated 6 months ago
- Official Code for Neural Systematic Binder☆33Updated 2 years ago
- Implementation of the video diffusion model and training scheme presented in the paper, Flexible Diffusion Modeling of Long Videos, in Py…☆85Updated 3 years ago
- Release of ImageNet-Captions☆51Updated 2 years ago
- Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Foll…☆38Updated last year
- Code for Look for the Change paper published at CVPR 2022☆36Updated 3 years ago
- VideoCC is a dataset containing (video-URL, caption) pairs for training video-text machine learning models. It is created using an automa…☆78Updated 2 years ago
- Pytorch Implementation of paper "Object-Centric Learning with Slot Attention"☆102Updated 2 years ago