SheldonTsui/PseudoBinaural_CVPR2021

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SheldonTsui/PseudoBinaural_CVPR2021)

SheldonTsui / PseudoBinaural_CVPR2021

Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)

☆72

Alternatives and similar repositories for PseudoBinaural_CVPR2021

Users that are interested in PseudoBinaural_CVPR2021 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SheldonTsui / SepStereo_ECCV2020
View on GitHub
Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)
☆72Oct 20, 2020Updated 5 years ago
facebookresearch / 2.5D-Visual-Sound
View on GitHub
2.5D visual sound
☆121Jul 25, 2023Updated 3 years ago
facebookresearch / learning-audio-visual-dereverberation
View on GitHub
Code for paper Learning Audio-Visual Dereverberation
☆32Aug 10, 2022Updated 3 years ago
pedro-morgado / spatialaudiogen
View on GitHub
Spatial Audio Generation
☆117Mar 24, 2023Updated 3 years ago
XingangPan / OCDA-Driving-Example
View on GitHub
Example code for OCDA-Driving
☆15Nov 22, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
facebookresearch / BinauralSpeechSynthesis
View on GitHub
N/A
☆190May 19, 2022Updated 4 years ago
V-Sense / 360AudioVisual
View on GitHub
This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality
☆13Jul 2, 2019Updated 7 years ago
jin-woo-lee / nfs-binaural
View on GitHub
☆14Aug 13, 2023Updated 2 years ago
facebookresearch / FAIR-Play
View on GitHub
2.5D visual sound dataset
☆108Sep 21, 2021Updated 4 years ago
facebookresearch / visual-acoustic-matching
View on GitHub
Repo for Visual Acoustic Matching, CVPR 2022
☆71Feb 28, 2023Updated 3 years ago
zaocan666 / DyViSE
View on GitHub
Dynamic vision-guided speaker embedding for audio-visual speaker diarization
☆12Jul 5, 2022Updated 4 years ago
YYX666660 / LAVSS
View on GitHub
Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation
☆19Feb 25, 2025Updated last year
HS-YN / PanoAVQA
View on GitHub
Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)
☆16Oct 12, 2021Updated 4 years ago
ycxioooong / MovieSynopsisAssociation
View on GitHub
Code for "A Graph-Based Framework to Bridge Movies and Synopses", ICCV2019
☆52Aug 9, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
KranthiKumarR / Localize-to-Binauralize
View on GitHub
Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization (ICCV 2021)
☆10Oct 11, 2021Updated 4 years ago
pedro-morgado / AVSpatialAlignment
View on GitHub
☆31Jun 14, 2022Updated 4 years ago
krantiparida / beyond-image-to-depth
View on GitHub
☆38Jun 29, 2021Updated 5 years ago
facebookresearch / sound-spaces
View on GitHub
A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple task…
☆468Sep 29, 2023Updated 2 years ago
daemon / pytorch-pcen
View on GitHub
PyTorch reimplementation of per-channel energy normalization for audio.
☆107Mar 29, 2019Updated 7 years ago
apple / ml-nvas3d
View on GitHub
☆49Jul 20, 2024Updated 2 years ago
ku-vai / TPoS
View on GitHub
This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)
☆25Dec 7, 2023Updated 2 years ago
ws-choi / LASAFT-Net-v2
View on GitHub
A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"
☆33Apr 11, 2022Updated 4 years ago
Yu-Wu / Modaily-Aware-Audio-Visual-Video-Parsing
View on GitHub
Code for CVPR 2021 paper Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing
☆24Dec 29, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
facebookresearch / A2B
View on GitHub
A2B Neural Rendering of Ambisonic Recordings to Binaural
☆20Aug 5, 2025Updated 11 months ago
tym002 / Hyper-Convolution
View on GitHub
Hyper-Convolution Networks for Biomedical Image Segmentation
☆30Mar 24, 2023Updated 3 years ago
limbo0000 / InstanceLoc
View on GitHub
[CVPR 2021] Instance Localization for Self-supervised Detection Pretraining
☆145Jun 8, 2021Updated 5 years ago
IFICL / stereocrw
View on GitHub
Code for the Paper: [ECCV2022] Sound Localization by Self-Supervised Time-Delay Estimation
☆28Mar 15, 2023Updated 3 years ago
krantiparida / awesome-audio-visual
View on GitHub
A curated list of different papers and datasets in various areas of audio-visual processing
☆775Jan 30, 2024Updated 2 years ago
IFICL / SLfM
View on GitHub
Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation
☆43Jul 16, 2026Updated 2 weeks ago
SAKi-77 / DiffStereo
View on GitHub
DiffStereo: End-to-End Mono-to-Stereo Audio Generation with Diffusion Transformer
☆17Apr 17, 2026Updated 3 months ago
rethinking-3d-gans / code
View on GitHub
Source code for "Rethinking training of 3D GANs"
☆31May 26, 2022Updated 4 years ago
kamo-naoyuki / pytorch_complex
View on GitHub
A temporal module for PyTorch-ComplexTensor
☆44Jun 28, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
uark-cviu / Right2Talk
View on GitHub
[ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach
☆20Aug 2, 2021Updated 4 years ago
Hangz-nju-cuhk / Vision-Infused-Audio-Inpainter-VIAI
View on GitHub
Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)
☆58Oct 25, 2019Updated 6 years ago
thomasdeppisch / eMagLS
View on GitHub
The End-to-End Magnitude Least Squares Binaural Renderer for Spherical Microphone Array Signals
☆41Feb 17, 2026Updated 5 months ago
SheldonTsui / GOF_NeurIPS2021
View on GitHub
The codebase for our paper "Generative Occupancy Fields for 3D Surface-Aware Image Synthesis" (NeurIPS 2021)
☆103Apr 18, 2022Updated 4 years ago
bingo-todd / WaveLoc
View on GitHub
End-to-End binaural sound localization
☆17Feb 27, 2020Updated 6 years ago
facebookresearch / real-acoustic-fields
View on GitHub
Real Acoustic Fields An Audio-Visual Room Acoustics Dataset and Benchmark
☆64Aug 29, 2024Updated last year
KawhiZhao / Egocentric-Audio-Visual-Speaker-Localization
View on GitHub
Code for paper Audio Visual Speaker Localization from EgoCentric Views
☆11Jul 3, 2024Updated 2 years ago