liangsusan-git/AV-NeRF

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/liangsusan-git/AV-NeRF)

liangsusan-git / AV-NeRF

[NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis

☆36

Alternatives and similar repositories for AV-NeRF

Users that are interested in AV-NeRF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

maswang32 / hearinganythinganywhere
View on GitHub
Hearing Anything Anywhere Code Release
☆52Nov 11, 2025Updated 8 months ago
facebookresearch / real-acoustic-fields
View on GitHub
Real Acoustic Fields An Audio-Visual Room Acoustics Dataset and Benchmark
☆64Aug 29, 2024Updated last year
AmandineBtto / NeRAF
View on GitHub
[ICLR 2025] NeRAF jointly learns acoustic and radiance fields, enabling realistic audio-visual generation.
☆37Mar 11, 2026Updated 4 months ago
jaeyeonkim99 / visage
View on GitHub
Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)
☆47Sep 10, 2025Updated 10 months ago
aluo-x / Learning_Neural_Acoustic_Fields
View on GitHub
Official code for "Learning Neural Acoustic Fields" (NeurIPS 2022)
☆167Jan 20, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
facebookresearch / soundvista
View on GitHub
soundvista
☆16Dec 31, 2025Updated 6 months ago
Surrey-UP-Lab / AV-GS
View on GitHub
AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis
☆14Oct 3, 2024Updated last year
WikiChao / FreSca
View on GitHub
[CVPR 2025 GMCV] Test-Time Frequency Scaling: Instant Frequency Control for Any Diffusion Model
☆55May 31, 2025Updated last year
WikiChao / ScalingConcept
View on GitHub
☆24Nov 1, 2024Updated last year
jing-bi / awesome-M.LLM-reasoning
View on GitHub
☆20May 11, 2025Updated last year
WikiChao / DAVIS
View on GitHub
[🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …
☆33Mar 30, 2026Updated 3 months ago
SAGNIKMJR / few-shot-rir
View on GitHub
Code and datasets for 'Few-Shot Audio-Visual Learning of Environment Acoustics' (NeurIPS 2022)
☆24Jun 16, 2026Updated last month
lzhangbj / ASVA
View on GitHub
[ECCV 2024 Oral] Audio-Synchronized Visual Animation
☆60Mar 15, 2026Updated 4 months ago
facebookresearch / AcousticRooms
View on GitHub
Open repository of simulated Room Impulse Responses (RIR) accompanying the paper "Hearing Anywhere in Any Environment"
☆81Aug 11, 2025Updated 11 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
facebookresearch / replay_dataset
View on GitHub
Download scripts and tools for Replay dataset.
☆39Jun 23, 2023Updated 3 years ago
ahogg / HRTF-upsampling-with-a-generative-adversarial-network-using-a-gnomonic-equiangular-projection
View on GitHub
☆12Nov 1, 2024Updated last year
DragonLiu1995 / xRIR_code
View on GitHub
[CVPR 2025] Pytorch implementation of the paper "Hearing Anywhere in Any Environment"
☆33Sep 18, 2025Updated 10 months ago
arunbalajeev / binaural-sound-perception
View on GitHub
Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds
☆20Dec 18, 2021Updated 4 years ago
yongyizang / GSound-SIR
View on GitHub
A Python Room Spatial Impulse Response Ray-Tracing Toolkit
☆86Mar 4, 2026Updated 4 months ago
zhiwei-zzz / MoScale
View on GitHub
[CVPR 2026] Next-Scale Autoregressive Models for Text-to-Motion Generation
☆16Jun 14, 2026Updated last month
DTaoo / DMC
View on GitHub
Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)
☆15May 27, 2020Updated 6 years ago
whzikaros / g2pL
View on GitHub
The implementation of g2pL with a new open dataset.
☆16May 14, 2023Updated 3 years ago
Red-Fairy / argus-code
View on GitHub
[ICCV 2025] Official repository of the paper "Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos"
☆45Feb 2, 2026Updated 5 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
pedro-morgado / spatialaudiogen
View on GitHub
Spatial Audio Generation
☆117Mar 24, 2023Updated 3 years ago
yunlong10 / CAT-V
View on GitHub
[AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal P…
☆67Jan 27, 2026Updated 5 months ago
jianzongwu / Does-Hearing-Help-Seeing
View on GitHub
☆19Dec 3, 2025Updated 7 months ago
mickey1356 / acoustic_reliefs
View on GitHub
Create acoustic diffusers with custom images!
☆20Jan 13, 2026Updated 6 months ago
facebookresearch / learning-audio-visual-dereverberation
View on GitHub
Code for paper Learning Audio-Visual Dereverberation
☆32Aug 10, 2022Updated 3 years ago
deeplsd / Merkel-Podcast-Corpus
View on GitHub
This dataset is presented in the paper Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video…
☆12Sep 21, 2022Updated 3 years ago
ArrayDPS / ArrayDPS
View on GitHub
☆40May 12, 2025Updated last year
penn-waves-lab / AcoustiX
View on GitHub
[NeurIPS'24 splotlight] Official Repo for AcoustiX used in Acoustic volume rendering for neural impulse response fields.
☆37Dec 15, 2025Updated 7 months ago
WikiChao / Ego-AV-Loc
View on GitHub
[CVPR 2023] Egocentric Audio-Visual Object Localization
☆27Jan 6, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Wxyxixixi / DoubleDiffusion_3D_Mesh
View on GitHub
Diffusion generation on Mesh toolbox
☆25Feb 10, 2025Updated last year
shlizee / savvy
View on GitHub
Repository for SAVVY(Spatial Awareness via Audio-Visual LLMs through Seeing and Hearing) Benchmark and SAVVY model
☆25May 30, 2026Updated last month
facebookresearch / SoundingBodies
View on GitHub
We present a model that can generate accurate 3D sound fields of human bodies from headset microphones and body pose as inputs.
☆93May 29, 2024Updated 2 years ago
ZehuaKcrissLi / GTR-Voice
View on GitHub
☆16Nov 11, 2024Updated last year
AudibleLight / AudibleLight
View on GitHub
A controllable, end-to-end API for soundscape synthesis across ray-traced & real-world measured acoustics
☆27Apr 1, 2026Updated 3 months ago
PeiwenSun2000 / Both-Ears-Wide-Open
View on GitHub
The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
☆65Jul 2, 2025Updated last year
samuel-clarke / RealImpact
View on GitHub
☆34Apr 10, 2023Updated 3 years ago