penn-waves-lab / AVR
[Neurips'24 Spotlight] Official code for "Acoustic Volume Rendering for Neural Impulse Response Fields"
☆28Updated last month
Alternatives and similar repositories for AVR:
Users that are interested in AVR are comparing it to the libraries listed below
- Real Acoustic Fields An Audio-Visual Room Acoustics Dataset and Benchmark☆41Updated 5 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆46Updated 5 months ago
- Code for paper Learning Audio-Visual Dereverberation☆26Updated 2 years ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆25Updated 2 months ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆30Updated last year
- Hearing Anything Anywhere Code Release☆34Updated 8 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆36Updated last year
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆22Updated 5 months ago
- 🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)☆40Updated last week
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆24Updated 10 months ago
- Code and datasets for 'Move2Hear: Active Audio-Visual Source Separation' (ICCV 2021)☆15Updated 2 years ago
- ☆9Updated 8 months ago
- Audio propagation engine - Meta Reality Labs Research.☆18Updated 2 years ago
- Da - ECHO - RetrievAl - daTasEt☆25Updated 7 months ago
- ☆24Updated last year
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆29Updated last year
- ☆47Updated 7 months ago
- Official code of ElasticAST (Interspeech 2024 paper)☆28Updated 6 months ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆37Updated 5 months ago
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆32Updated last year
- SRTNet☆24Updated last year
- Official implementation of Self-Remixing☆13Updated last year
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆41Updated last year
- Code and datasets for 'Few-Shot Audio-Visual Learning of Environment Acoustics' (NeurIPS 2022)☆14Updated last year
- Test code disclosure for the research paper "UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model", as a supplementa…☆19Updated last year