facebookresearch / soundspaces-challengeLinks
Starter code for SoundSpaces challenge at CVPR 21's Embodied AI workshop
☆12Updated 2 years ago
Alternatives and similar repositories for soundspaces-challenge
Users that are interested in soundspaces-challenge are comparing it to the libraries listed below
Sorting:
- ☆23Updated 4 years ago
- EgoCom: A Multi-person Multi-modal Egocentric Communications Dataset☆57Updated 4 years ago
- A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple task…☆399Updated last year
- VisualEchoes Dataset (ECCV 2020)☆35Updated 3 years ago
- Unofficial Implementation of Google Deepmind's paper `Objects that Sound`☆83Updated 7 years ago
- Code and datasets for 'Move2Hear: Active Audio-Visual Source Separation' (ICCV 2021)☆15Updated 2 years ago
- Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch☆65Updated 6 years ago
- 2.5D visual sound dataset☆99Updated 3 years ago
- SAAVN Code release for paper "Sound Adversarial Audio-Visual Navigation,ICLR2022" (In PyTorch)☆19Updated 2 years ago
- Official PyTorch implementation of "SPACE: Unsupervised Object-Oriented Scene Representation via Spatial Attention and Decomposition"☆104Updated last year
- Differentiable Dynamic Programming☆70Updated 4 years ago
- FiLM: Visual Reasoning with a General Conditioning Layer☆363Updated 3 years ago
- Official PyTorch implementation of GENESIS and GENESIS-V2☆110Updated 3 years ago
- Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"☆27Updated 3 years ago
- Burgess et al. "MONet: Unsupervised Scene Decomposition and Representation"☆88Updated 2 years ago
- Code for the paper Learning the Predictability of the Future (CVPR 2021)☆168Updated last year
- An implementation of the MONet model for unsupervised scene decomposition in PyTorch☆59Updated 3 years ago
- CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning☆105Updated 4 years ago
- Multi-object image datasets with ground-truth segmentation masks and generative factors.☆272Updated 3 years ago
- PyTorch implementation of ICLR 2020 paper "CLEVRER: CoLlision Events for Video REpresentation and Reasoning"☆121Updated 4 years ago
- Implementation of VQ-VAE for audio☆41Updated 7 years ago
- Keras Implementation of "Look, Listen and Learn" Model☆21Updated 7 years ago
- Code, data and benchmark from the paper "Unmasking the Inductive Biases of Unsupervised Object Representations for Video Sequences".☆36Updated 3 years ago
- Wavenet Autoencoder for Unsupervised speech representation learning (after Chorowski, Jan 2019)☆175Updated 4 years ago
- This repository contains the code to reproduce the core results from the paper "Learning Latent Representations for Speech Generation and…☆52Updated 7 years ago
- ☆67Updated 4 years ago
- Cornell Touchdown natural language navigation and spatial reasoning dataset.☆102Updated 4 years ago
- Vision and Language Agent Navigation☆80Updated 4 years ago
- Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features☆220Updated 6 years ago
- A PyTorch implementation of "Continuous Relaxation Training of Discrete Latent Variable Image Models"☆74Updated 5 years ago