KranthiKumarR / Localize-to-BinauralizeLinks
Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization (ICCV 2021)
☆10Updated 3 years ago
Alternatives and similar repositories for Localize-to-Binauralize
Users that are interested in Localize-to-Binauralize are comparing it to the libraries listed below
Sorting:
- ☆38Updated 4 months ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆85Updated last year
- ☆46Updated 2 years ago
- ☆17Updated last year
- A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)☆56Updated last year
- ☆18Updated last year
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆26Updated 11 months ago
- ☆42Updated 7 months ago
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆71Updated last year
- ☆37Updated 4 months ago
- Pytorch implementation for “V2C: Visual Voice Cloning”☆32Updated 2 years ago
- Repository of the WACV'24 paper "Can CLIP Help Sound Source Localization?"☆32Updated 6 months ago
- Official implementation of RAVEn (ICLR 2023) and BRAVEn (ICASSP 2024)☆68Updated 6 months ago
- Code for paper Learning Audio-Visual Dereverberation☆30Updated 3 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆40Updated last year
- The official implementation of V-AURA: Temporally Aligned Audio for Video with Autoregression (ICASSP 2025)☆28Updated 7 months ago
- Source code for "Synchformer: Efficient Synchronization from Sparse Cues" (ICASSP 2024)☆83Updated 6 months ago
- 🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)☆56Updated 6 months ago
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆29Updated 5 months ago
- A spoken version of the textual story cloze benchmark☆18Updated 2 years ago
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Updated 2 years ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆55Updated 10 months ago
- ☆42Updated 2 years ago
- ☆25Updated last year
- [ACM MM24] Official implementation of paper "From Speaker to Dubber: Movie Dubbing with Prosody and Duration Consistency Learning"☆28Updated 3 months ago
- Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).☆25Updated last year
- ☆23Updated last year
- Source code for DM-Codec.☆47Updated 2 months ago
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆38Updated last year
- ☆17Updated last year