liangsusan-git / AV-NeRF
[NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis
☆22Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for AV-NeRF
- [CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation☆63Updated last year
- Hearing Anything Anywhere Code Release☆28Updated 5 months ago
- ☆28Updated last month
- ☆22Updated last year
- [ECCV 2024 Oral] Audio-Synchronized Visual Animation☆37Updated 2 months ago
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆35Updated 10 months ago
- [ICCV 2023] Online Clustered Codebook☆148Updated 2 months ago
- Code for Novel View Acoustic Synthesis paper☆44Updated last year
- Download scripts and tools for Replay dataset.☆30Updated last year
- A Pytorch Implementation of Finite Scalar Quantization☆88Updated 11 months ago
- Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)☆30Updated 2 years ago
- Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)☆68Updated 4 years ago
- ☆19Updated 8 months ago
- Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)☆62Updated 3 years ago
- Real Acoustic Fields An Audio-Visual Room Acoustics Dataset and Benchmark☆37Updated 2 months ago
- Official code for "Learning Neural Acoustic Fields" (NeurIPS 2022)☆130Updated 10 months ago
- ☆44Updated 4 months ago
- [CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners☆128Updated 4 months ago
- ☆31Updated 8 months ago
- "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆40Updated 3 weeks ago
- Repository of the WACV'24 paper "Can CLIP Help Sound Source Localization?"☆14Updated 7 months ago
- Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation☆25Updated 2 years ago
- The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation☆13Updated last month
- This is the official implementation for ControlVAR.☆55Updated last month
- Voice Conversion Experiments for THUHCSI Course : <Digital Processing of Speech Signals>☆8Updated last year
- [CVPR 2023] Official implementation of our paper - Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learnin…☆23Updated last year
- Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".☆77Updated 11 months ago
- [arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization☆84Updated 5 months ago
- Efficient synchronization from sparse cues☆28Updated 6 months ago
- NeRAF jointly learns acoustic and radiance fields, enabling realistic audio-visual generation.☆11Updated 2 weeks ago