Surrey-UP-Lab / AV-GSLinks
AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis
☆12Updated last year
Alternatives and similar repositories for AV-GS
Users that are interested in AV-GS are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] NeRAF jointly learns acoustic and radiance fields, enabling realistic audio-visual generation.☆29Updated 5 months ago
- Download scripts and tools for Replay dataset.☆35Updated 2 years ago
- Code for Novel View Acoustic Synthesis paper☆51Updated 2 years ago
- Towards training VQ-VAE models robustly!☆85Updated 3 months ago
- ☆29Updated 2 years ago
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆40Updated last year
- [Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization☆48Updated last month
- ☆26Updated 7 months ago
- ☆46Updated last year
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis☆31Updated last year
- Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos☆25Updated last year
- Explore how to get a VQ-VAE models efficiently!☆60Updated 3 months ago
- [ICML 2025] Gaussian Mixture Flow Matching Models (GMFlow)☆142Updated 2 weeks ago
- ☆47Updated last year
- [CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation☆70Updated 2 years ago
- Code release for PianoMotion10M☆93Updated 7 months ago
- [CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network☆106Updated 3 months ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆97Updated 7 months ago
- Hearing Anything Anywhere Code Release☆48Updated last year
- pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation☆136Updated this week
- Official Implementation for StreamFlow: Streamlined Multi-Frame Optical Flow Estimation for Video Sequences, NeurIPS' 24☆35Updated 7 months ago
- HyperGaussians: High-Dimensional Gaussian Splatting for High-Fidelity Animatable Face Avatars☆29Updated 3 weeks ago
- ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer☆38Updated 10 months ago
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆54Updated 11 months ago
- ☆24Updated 7 months ago
- NeRF Revisited: Fixing Quadrature Instability in Volume Rendering, Neurips 2023☆73Updated last year
- [ICLR'24] GTA: A Geometry-Aware Attention Mechanism for Multi-view Transformers☆147Updated 6 months ago
- [ACM MM24 Poster] Official implementation of paper "MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllabili…☆20Updated last month
- An open source Multi-View Latent Diffusion Model☆38Updated 6 months ago
- Code and datasets for 'Few-Shot Audio-Visual Learning of Environment Acoustics' (NeurIPS 2022)☆23Updated last year