KranthiKumarR/Localize-to-Binauralize

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KranthiKumarR/Localize-to-Binauralize)

KranthiKumarR / Localize-to-Binauralize

Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization (ICCV 2021)

☆10

Alternatives and similar repositories for Localize-to-Binauralize

Users that are interested in Localize-to-Binauralize are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

karreny / telling-left-from-right
View on GitHub
Project website for "Telling left from right: Learning spatial correspondence between sight and sound"
☆29Jun 6, 2022Updated 4 years ago
yzyouzhang / Empirical-Channel-CM
View on GitHub
Official Implementation of our Interspeech 2021 paper "An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure …
☆19Feb 15, 2022Updated 4 years ago
uark-cviu / Right2Talk
View on GitHub
[ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach
☆20Aug 2, 2021Updated 4 years ago
NAVER-INTEL-Co-Lab / gaudi-lavcap
View on GitHub
☆15Jan 24, 2025Updated last year
Jungjee / ASVspoof_PA
View on GitHub
☆24Jun 28, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / viewseg
View on GitHub
Code for "Recognizing Scenes from Novel Viewpoints"
☆29Sep 16, 2022Updated 3 years ago
SheldonTsui / PseudoBinaural_CVPR2021
View on GitHub
Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)
☆72Jul 8, 2021Updated 5 years ago
Project-MANAS / projectmanas.in
View on GitHub
Project MANAS Official Website
☆13Jan 10, 2022Updated 4 years ago
pabdzadeh / voice-spoof-detection-system
View on GitHub
A voice spoofing detection system, based on paper presented at ICSPIS 2021
☆10Feb 11, 2022Updated 4 years ago
SheldonTsui / SepStereo_ECCV2020
View on GitHub
Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)
☆72Oct 20, 2020Updated 5 years ago
apple / ml-nvas3d
View on GitHub
☆49Jul 20, 2024Updated 2 years ago
yuhanghe01 / Sound3DVDet
View on GitHub
Code for WACV24 work for multiview acoustic-visual detection
☆13Mar 22, 2024Updated 2 years ago
yyyanbj / experiment-for-pl0-compiler-expansion
View on GitHub
🚀 海南大学编译原理 pl0 语言编译器扩充
☆11Dec 19, 2020Updated 5 years ago
MRSAudio / MRSAudio_Main
View on GitHub
MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations
☆43Oct 15, 2025Updated 9 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
cogmhear / Intelligibility-Oriented-Audio-Visual-Speech-Enhancement
View on GitHub
Towards Intelligibility-Oriented Audio-Visual Speech Enhancement
☆15Sep 6, 2024Updated last year
pedro-morgado / AVSpatialAlignment
View on GitHub
☆31Jun 14, 2022Updated 4 years ago
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
etzinis / heterogeneous_separation
View on GitHub
Code and data recipes for the paper: Heterogeneous Target Speech Separation
☆44Dec 6, 2022Updated 3 years ago
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
cvlab-columbia / paperbot
View on GitHub
PaperBot: Learning to Design Real-World Tools Using Paper
☆13Mar 15, 2024Updated 2 years ago
michaelneri / unsupervised-audio-anomaly-detection
View on GitHub
Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …
☆11Nov 6, 2024Updated last year
xyxingx / LumiNet
View on GitHub
[CVPR 2025] LumiNet: Latent Intrinsics Meets Diffusion Models for Indoor Scene Relighting
☆45Sep 16, 2025Updated 10 months ago
facebookresearch / BinauralSpeechSynthesis
View on GitHub
N/A
☆190May 19, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
haoheliu / ontology-aware-audio-tagging
View on GitHub
☆14Nov 22, 2022Updated 3 years ago
andrekassis / Breaking-Security-Critical-Voice-Authentication
View on GitHub
Source code for paper "Breaking Security-Critical Voice Authentication".
☆13Jul 10, 2023Updated 3 years ago
V-Sense / 360AudioVisual
View on GitHub
This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality
☆13Jul 2, 2019Updated 7 years ago
kaist-ami / SoundBrush
View on GitHub
☆14Dec 8, 2025Updated 7 months ago
lwang114 / GraphUnsupASR
View on GitHub
☆10Apr 17, 2024Updated 2 years ago
NadineKroher / PyCante
View on GitHub
CANTE: Automatic transcription of flamenco singing.
☆14Feb 13, 2018Updated 8 years ago
DTaoo / DMC
View on GitHub
Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)
☆15May 27, 2020Updated 6 years ago
Koziev / StressModel
View on GitHub
Neural model for prediction of stress position in Russian words
☆13Jun 22, 2025Updated last year
frozentoad9 / CMST
View on GitHub
Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages
☆13Oct 12, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Joechann0831 / LFZSSR
View on GitHub
Light Field Super-Resolution with Zero-Shot Learning, CVPR 2021, Oral.
☆30Oct 21, 2021Updated 4 years ago
cvlab-columbia / trajectories
View on GitHub
Code for the paper "Representing Spatial Trajectories as Distributions"
☆13Jan 17, 2023Updated 3 years ago
TTS-Research / PEL-TTS
View on GitHub
☆14Aug 16, 2023Updated 2 years ago
Soul-AILab / Soul-AILab.github.io
View on GitHub
☆17Jun 2, 2026Updated last month
zaocan666 / DyViSE
View on GitHub
Dynamic vision-guided speaker embedding for audio-visual speaker diarization
☆12Jul 5, 2022Updated 4 years ago
BiometricVox / DAE_SpeakerID
View on GitHub
Denoising autoencoders for speaker identification on MCE 2018 challenge
☆12Nov 8, 2018Updated 7 years ago
lmaxwell / McHuo
View on GitHub
A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes
☆12Oct 19, 2023Updated 2 years ago