facebookresearch / visual-acoustic-matching
Repo for Visual Acoustic Matching, CVPR 2022
☆60Updated last year
Related projects: ⓘ
- Code for paper Learning Audio-Visual Dereverberation☆25Updated 2 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆39Updated last year
- The Easy Communications (EasyCom) dataset is a world-first dataset designed to help mitigate the *cocktail party effect* from an augmente…☆103Updated 9 months ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆82Updated 2 years ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆56Updated last year
- Code and datasets for 'Few-Shot Audio-Visual Learning of Environment Acoustics' (NeurIPS 2022)☆13Updated 9 months ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆46Updated 2 years ago
- Python code for handling the Clotho dataset.☆74Updated 3 years ago
- COLA contrastive pre-training method implemented in PyTorch☆42Updated 3 years ago
- Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".☆45Updated 2 years ago
- ☆35Updated last year
- Project website for "Telling left from right: Learning spatial correspondence between sight and sound"☆20Updated 2 years ago
- 🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)☆28Updated 3 months ago
- Code for the IEEE Signal Processing Letters 2022 paper "UAVM: Towards Unifying Audio and Visual Models".☆55Updated last year
- Real Acoustic Fields An Audio-Visual Room Acoustics Dataset and Benchmark☆36Updated 3 weeks ago
- Evaluation script for VoxMovies dataset in PyTorch☆22Updated 8 months ago
- Code and datasets for 'Move2Hear: Active Audio-Visual Source Separation' (ICCV 2021)☆13Updated last year
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆33Updated last year
- ☆25Updated 3 years ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆108Updated last year
- PyTorch Dataset for Speech and Music audio☆73Updated 2 months ago
- Solos: A Dataset for Audio-Visual Music Analysis☆21Updated last year
- Learning differentiable temporal resolution on time-series data.☆33Updated last year
- Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"☆49Updated last year
- Source code for the paper 'Audio Captioning Transformer'☆47Updated 2 years ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆61Updated 2 years ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆39Updated last year
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆38Updated last year
- Code for the Paper: [ECCV2022] Sound Localization by Self-Supervised Time-Delay Estimation☆18Updated last year
- Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.☆23Updated last year