facebookresearch / soundspaces-challengeLinks
Starter code for SoundSpaces challenge at CVPR 21's Embodied AI workshop
☆14Updated 2 years ago
Alternatives and similar repositories for soundspaces-challenge
Users that are interested in soundspaces-challenge are comparing it to the libraries listed below
Sorting:
- ☆23Updated 5 years ago
- VisualEchoes Dataset (ECCV 2020)☆36Updated 4 years ago
- A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple task…☆423Updated 2 years ago
- Unofficial Implementation of Google Deepmind's paper `Objects that Sound`☆83Updated 7 years ago
- EgoCom: A Multi-person Multi-modal Egocentric Communications Dataset☆59Updated 5 years ago
- 2.5D visual sound dataset☆102Updated 4 years ago
- Code and datasets for 'Move2Hear: Active Audio-Visual Source Separation' (ICCV 2021)☆15Updated 2 years ago
- Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch☆65Updated 7 years ago
- SAAVN Code release for paper "Sound Adversarial Audio-Visual Navigation,ICLR2022" (In PyTorch)☆19Updated 3 years ago
- Burgess et al. "MONet: Unsupervised Scene Decomposition and Representation"☆89Updated 3 years ago
- An implementation of the MONet model for unsupervised scene decomposition in PyTorch☆59Updated 3 years ago
- Vector Quantized Contrastive Predictive Coding for Template-based Music Generation☆83Updated 2 years ago
- Repo for Visual Acoustic Matching, CVPR 2022☆70Updated 2 years ago
- Code for the paper Learning the Predictability of the Future (CVPR 2021)☆170Updated 2 years ago
- Official PyTorch implementation of "SPACE: Unsupervised Object-Oriented Scene Representation via Spatial Attention and Decomposition"☆104Updated 2 years ago
- Differentiable Dynamic Programming☆71Updated 5 years ago
- 2.5D visual sound☆116Updated 2 years ago
- Official PyTorch implementation of "Improving Generative Imagination in Object-Centric World Models"☆37Updated 3 years ago
- Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"☆27Updated 3 years ago
- Vision and Language Agent Navigation☆82Updated 4 years ago
- A simplified PyTorch implementation of GANsynth☆83Updated 6 years ago
- PyTorch implementation of FiLM: Visual Reasoning with a General Conditioning Layer☆64Updated 6 years ago
- CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning☆107Updated 4 years ago
- Implementation of VQ-VAE for audio☆43Updated 7 years ago
- Audio propagation engine - Meta Reality Labs Research.☆21Updated 3 years ago
- Official PyTorch implementation of GENESIS and GENESIS-V2☆108Updated 3 years ago
- Bongard-LOGO is a Python code repository with the purpose of generating synthetic Bongard problems on a large scale with little human int…☆53Updated 3 years ago
- FiLM: Visual Reasoning with a General Conditioning Layer☆410Updated 3 years ago
- ReaSCAN is a synthetic navigation task that requires models to reason about surroundings over syntactically difficult languages. (NeurIPS…☆19Updated 4 years ago
- Code, data and benchmark from the paper "Unmasking the Inductive Biases of Unsupervised Object Representations for Video Sequences".☆36Updated 4 years ago