facebookresearch / novel-view-acoustic-synthesisLinks
Code for Novel View Acoustic Synthesis paper
☆48Updated last year
Alternatives and similar repositories for novel-view-acoustic-synthesis
Users that are interested in novel-view-acoustic-synthesis are comparing it to the libraries listed below
Sorting:
- Download scripts and tools for Replay dataset.☆33Updated 2 years ago
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆39Updated last year
- AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis☆11Updated 9 months ago
- ☆27Updated 2 years ago
- ☆46Updated last year
- [ICLR 2025] NeRAF jointly learns acoustic and radiance fields, enabling realistic audio-visual generation.☆21Updated 2 months ago
- ☆47Updated 11 months ago
- We present a model that can generate accurate 3D sound fields of human bodies from headset microphones and body pose as inputs.☆87Updated last year
- Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)☆65Updated 4 years ago
- Facestar dataset. High quality audio-visual recordings of human conversational speech.☆109Updated 3 years ago
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated 2 years ago
- Offical code for the CVPR 2024 Paper: Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language☆83Updated last year
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Updated 2 years ago
- Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)☆15Updated 2 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆84Updated last year
- [ICML 2025] Gaussian Mixture Flow Matching Models (GMFlow)☆111Updated last month
- Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis☆39Updated last year
- Repo for Visual Acoustic Matching, CVPR 2022☆68Updated 2 years ago
- Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".☆87Updated last year
- Official code of AAAI'23 paper AudioEar: Single-View Ear Reconstruction for Personalized Spatial Audio written in PyTorch☆35Updated last year
- Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization (ICCV 2021)☆10Updated 3 years ago
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆42Updated last year
- Long-Term Rhythmic Video Soundtracker, ICML2023☆59Updated last year
- Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos☆22Updated 9 months ago
- Code for "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces" ACM MM 2023☆30Updated last year
- Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).☆25Updated last year
- Implementation of NWT, audio-to-video generation, in Pytorch☆91Updated 3 years ago
- Official code for "Learning Neural Acoustic Fields" (NeurIPS 2022)☆142Updated last year
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆52Updated 7 months ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆87Updated 9 months ago