bingo-todd / WaveLoc
End-to-End binaural sound localization
☆14Updated 5 years ago
Alternatives and similar repositories for WaveLoc:
Users that are interested in WaveLoc are comparing it to the libraries listed below
- A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]☆24Updated 2 years ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆48Updated 5 months ago
- A Track-Wise Ensemble Event Independent Network for 3D Polyphonic Sound Event Localization and Detection☆20Updated 4 months ago
- Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3☆33Updated last year
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆34Updated 11 months ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆36Updated 5 months ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆34Updated 5 months ago
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]☆102Updated 3 months ago
- This is the official implementation of the LiSenNet☆64Updated 4 months ago
- Baseline method for sound event localization task of DCASE 2022 challenge☆54Updated 2 years ago
- ☆19Updated last year
- This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', wh…☆67Updated 2 years ago
- PyTorch implementation of LiMuSE☆30Updated 2 years ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆47Updated 2 months ago
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…☆26Updated 2 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- ☆55Updated last year
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆33Updated 7 months ago
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆67Updated 3 years ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆41Updated last year
- ☆44Updated last year
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆111Updated last year
- Causality Check in Frame-online Speech Separation☆44Updated 2 years ago
- multi-channel target speech extraction with channel decorrelation and target speaker adaptation☆25Updated 4 years ago
- Query-conditioned target sound extraction model☆20Updated 4 months ago
- ☆64Updated last year
- Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation☆101Updated 2 years ago
- wsj0-{2, 3, 4, 5} mix generation scripts, in Python.☆56Updated 4 years ago