SAGNIKMJR / few-shot-rirLinks
Code and datasets for 'Few-Shot Audio-Visual Learning of Environment Acoustics' (NeurIPS 2022)
☆19Updated last year
Alternatives and similar repositories for few-shot-rir
Users that are interested in few-shot-rir are comparing it to the libraries listed below
Sorting:
- [Neurips'24 Spotlight] Official code for "Acoustic Volume Rendering for Neural Impulse Response Fields"☆42Updated 7 months ago
- Real Acoustic Fields An Audio-Visual Room Acoustics Dataset and Benchmark☆53Updated last year
- Code for paper Learning Audio-Visual Dereverberation☆30Updated 3 years ago
- [NeurIPS'24 splotlight] Official Repo for AcoustiX used in Acoustic volume rendering for neural impulse response fields.☆31Updated 5 months ago
- Hearing Anything Anywhere Code Release☆45Updated last year
- 🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)☆56Updated 6 months ago
- ☆28Updated 2 years ago
- Repo for Visual Acoustic Matching, CVPR 2022☆68Updated 2 years ago
- Official code for "Learning Neural Acoustic Fields" (NeurIPS 2022)☆149Updated last year
- Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)☆30Updated 2 months ago
- Repository of the WACV'24 paper "Can CLIP Help Sound Source Localization?"☆32Updated 6 months ago
- ☆42Updated 2 years ago
- Code for the Paper: [ECCV2022] Sound Localization by Self-Supervised Time-Delay Estimation☆22Updated 2 years ago
- The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation☆47Updated last month
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Updated 2 years ago
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆33Updated last year
- Sound field estimation based on physics-constrained neural kernel☆13Updated 2 months ago
- [CVPR 2025] Pytorch implementation of the paper "Hearing Anywhere in Any Environment"☆16Updated this week
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆44Updated 11 months ago
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆40Updated last year
- ☆46Updated last year
- The Easy Communications (EasyCom) dataset is a world-first dataset designed to help mitigate the *cocktail party effect* from an augmente…☆120Updated last year
- [ICLR 2025] NeRAF jointly learns acoustic and radiance fields, enabling realistic audio-visual generation.☆25Updated 3 months ago
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis☆29Updated last year
- The official implementation of V-AURA: Temporally Aligned Audio for Video with Autoregression (ICASSP 2025)☆28Updated 8 months ago
- code for A Large-scale Dataset for Audio-Language Representation Learning☆14Updated 11 months ago
- Audio propagation engine - Meta Reality Labs Research.☆21Updated 2 years ago
- Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge☆53Updated 5 months ago
- Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation☆15Updated 6 months ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆48Updated 2 months ago