Surrey-UP-Lab / AV-GSLinks
AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis
☆11Updated 8 months ago
Alternatives and similar repositories for AV-GS
Users that are interested in AV-GS are comparing it to the libraries listed below
Sorting:
- Code for Novel View Acoustic Synthesis paper☆48Updated last year
- [ICLR 2025] NeRAF jointly learns acoustic and radiance fields, enabling realistic audio-visual generation.☆21Updated last month
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆39Updated last year
- Download scripts and tools for Replay dataset.☆33Updated 2 years ago
- Towards training VQ-VAE models robustly!☆74Updated 5 months ago
- ☆27Updated 2 years ago
- Hearing Anything Anywhere Code Release☆41Updated last year
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis☆27Updated last year
- Optimal-State Dynamics Estimation for Physics-based Human Motion Capture from Videos☆14Updated 4 months ago
- AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation☆15Updated 6 months ago
- ☆47Updated 11 months ago
- MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation☆24Updated 3 months ago
- Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos☆21Updated 8 months ago
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆24Updated 9 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆35Updated last year
- Code and datasets for 'Few-Shot Audio-Visual Learning of Environment Acoustics' (NeurIPS 2022)☆19Updated last year
- ☆38Updated 3 years ago
- Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization (ICCV 2021)☆10Updated 3 years ago
- 3D-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors☆10Updated last week
- TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation Code☆24Updated 11 months ago
- ☆46Updated 11 months ago
- Code for paper Background Prompting for Improved Object Depth☆29Updated last year
- Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)☆65Updated 3 years ago
- [CVPR 2024] 3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surfaces☆26Updated last year
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆70Updated 10 months ago
- A toolkit for computing Fréchet Inception Distance (FID) & Fréchet Video Distance (FVD) metrics.☆30Updated 3 weeks ago
- ☆48Updated 3 months ago
- NeRF Revisited: Fixing Quadrature Instability in Volume Rendering, Neurips 2023☆75Updated last year
- AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model☆29Updated this week
- The official implementation of V-AURA: Temporally Aligned Audio for Video with Autoregression (ICASSP 2025)☆27Updated 5 months ago