maswang32 / hearinganythinganywhere
Hearing Anything Anywhere Code Release
☆28Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for hearinganythinganywhere
- Real Acoustic Fields An Audio-Visual Room Acoustics Dataset and Benchmark☆37Updated 2 months ago
- Official code for "Learning Neural Acoustic Fields" (NeurIPS 2022)☆130Updated 10 months ago
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis☆22Updated 9 months ago
- The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation☆13Updated last month
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆35Updated 10 months ago
- ☆22Updated last year
- ☆44Updated 4 months ago
- Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)☆62Updated 3 years ago
- ☆46Updated 4 months ago
- NeRAF jointly learns acoustic and radiance fields, enabling realistic audio-visual generation.☆11Updated 2 weeks ago
- Code for Novel View Acoustic Synthesis paper☆44Updated last year
- Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)☆68Updated 4 years ago
- Code for the Paper: [ECCV2022] Sound Localization by Self-Supervised Time-Delay Estimation☆19Updated last year
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆57Updated 2 months ago
- Voice Conversion Experiments for THUHCSI Course : <Digital Processing of Speech Signals>☆8Updated last year
- Repo for Visual Acoustic Matching, CVPR 2022☆65Updated last year
- Code and datasets for 'Move2Hear: Active Audio-Visual Source Separation' (ICCV 2021)☆14Updated last year
- Code and datasets for 'Few-Shot Audio-Visual Learning of Environment Acoustics' (NeurIPS 2022)☆14Updated last year
- Code for paper Learning Audio-Visual Dereverberation☆26Updated 2 years ago
- Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".☆77Updated 11 months ago
- ☆35Updated last year
- [CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation☆63Updated last year
- N/A☆165Updated 2 years ago
- The Easy Communications (EasyCom) dataset is a world-first dataset designed to help mitigate the *cocktail party effect* from an augmente…☆106Updated 11 months ago
- 🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)☆32Updated last month
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆18Updated 2 months ago
- Efficient synchronization from sparse cues☆28Updated 6 months ago
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆66Updated 2 weeks ago
- PAM is a no-reference audio quality metric for audio generation tasks☆49Updated 4 months ago
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆26Updated 5 months ago