Surrey-UP-Lab / AV-GSLinks
AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis
☆12Updated last year
Alternatives and similar repositories for AV-GS
Users that are interested in AV-GS are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] NeRAF jointly learns acoustic and radiance fields, enabling realistic audio-visual generation.☆29Updated 4 months ago
- Download scripts and tools for Replay dataset.☆35Updated 2 years ago
- Code for Novel View Acoustic Synthesis paper☆51Updated 2 years ago
- Towards training VQ-VAE models robustly!☆84Updated 2 months ago
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis☆30Updated last year
- [Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization☆48Updated 3 weeks ago
- ☆29Updated 2 years ago
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆40Updated last year
- HyperGaussians: High-Dimensional Gaussian Splatting for High-Fidelity Animatable Face Avatars☆29Updated this week
- ☆25Updated 6 months ago
- [CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network☆106Updated 2 months ago
- Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos☆24Updated last year
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆96Updated 6 months ago
- An open source Multi-View Latent Diffusion Model☆38Updated 5 months ago
- NeRF Revisited: Fixing Quadrature Instability in Volume Rendering, Neurips 2023☆73Updated last year
- PyTorch re-implementation for MeanFlow☆99Updated 2 months ago
- ☆47Updated last year
- [ICML 2025] Gaussian Mixture Flow Matching Models (GMFlow)☆138Updated 4 months ago
- Scaling Properties of Diffusion Models For Perceptual Tasks (CVPR 2025)☆43Updated 5 months ago
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆23Updated 4 months ago
- RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space☆32Updated 2 weeks ago
- Hearing Anything Anywhere Code Release☆47Updated last year
- Code release for PianoMotion10M☆92Updated 6 months ago
- [NeurIPS 2025] Official code for JAFAR: Jack up Any Feature at Any Resolution☆170Updated 2 weeks ago
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆54Updated 10 months ago
- [CVPR 2024] 3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surfaces☆27Updated last year
- [ICLR'24] GTA: A Geometry-Aware Attention Mechanism for Multi-view Transformers☆145Updated 5 months ago
- Explore how to get a VQ-VAE models efficiently!☆57Updated 2 months ago
- [ICCV'25] ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment☆32Updated last week
- ☆48Updated last year