FYJNEVERFOLLOWS/Awesome-Sound-Source-Localization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FYJNEVERFOLLOWS/Awesome-Sound-Source-Localization)

FYJNEVERFOLLOWS / Awesome-Sound-Source-Localization

A tutorial for Sound Source Localization researchers and practitioners. The purpose of this repo is to organize the world’s resources for Sound Source Localization, and make them universally accessible and useful.

☆59

Alternatives and similar repositories for Awesome-Sound-Source-Localization

Users that are interested in Awesome-Sound-Source-Localization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

idiap / nnsslm
View on GitHub
Neural Network based Sound Source Localization Models
☆51Aug 29, 2023Updated 2 years ago
egrinstein / neural_srp
View on GitHub
The Neural-SRP method for DOA estimation
☆37May 24, 2024Updated 2 years ago
Audio-WestlakeU / FN-SSL
View on GitHub
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
☆159Mar 10, 2026Updated 4 months ago
BrownsugarZeer / Multi_SSL
View on GitHub
Combine sound source separation with SRP-PHAT to achieve multi-source localization.
☆97Jan 22, 2025Updated last year
BingYang-20 / SRP-DNN
View on GitHub
A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]
☆67Sep 28, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
axeber01 / wav2pos
View on GitHub
3D Sound Source Localization using Masked Autoencoders
☆21Feb 12, 2025Updated last year
egrinstein / gnn_ssl
View on GitHub
Graph Neural Networks for Sound Source Localization
☆29Oct 31, 2023Updated 2 years ago
FYJNEVERFOLLOWS / ResNet-STFT-SSL
View on GitHub
ResNet-STFT Model for Sound Source Localization
☆20Aug 25, 2022Updated 3 years ago
Devin-Pi / uncertainty-estimation-for-ssl
View on GitHub
This repo is for the paper "Uncertainty Estimation for Sound Source Localization".
☆15Mar 13, 2025Updated last year
tdietzen / INST-PSD
View on GitHub
Instantaneous PSD estimation for speech enhancement based on generalized principal components.
☆11Jul 1, 2020Updated 6 years ago
DavidDiazGuerra / icoDOA
View on GitHub
Code repository for the paper Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs
☆49May 19, 2022Updated 4 years ago
RusselZHANG / Microphone-Array-Generalization-for-Multichannel-Narrowband-Deep-Speech-Enhancement
View on GitHub
This is the microphone array generalization investigation based on previous Narrow Band Deep Filtering methods.
☆38Mar 12, 2024Updated 2 years ago
catherine-qian / cocosda-SSL
View on GitHub
pytorch code for sound event localization and classification
☆13Aug 12, 2021Updated 4 years ago
Audio-WestlakeU / RealMAN
View on GitHub
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…
☆175Apr 29, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
vlsi-nanocomputing / dynamic-sound
View on GitHub
DynamicSound Simulator is a modular Python library for generating virtual acoustic scenes with configurable microphones, sound sources, a…
☆18Jul 15, 2026Updated last week
jingkangqi / DSENet
View on GitHub
☆34Feb 19, 2025Updated last year
vipchengrui / MASG
View on GitHub
microphone array speech generator (MASG) in room acoustic
☆39Jan 2, 2020Updated 6 years ago
DavidDiazGuerra / Cross3D
View on GitHub
Code repository for the paper Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks
☆90Mar 24, 2023Updated 3 years ago
alexandergwm / Classical-Sound-Source-Localization-Algorithms-in-Spherical-Domain
View on GitHub
Here is a repository stored the classical sound source localization algorithms in spherical domain, namely, PWD, DAS, SHMUSIC, SHMVDR, S…
☆23Nov 16, 2023Updated 2 years ago
axeber01 / ngcc-seld
View on GitHub
Sound Event Localization and Detection using Neural Generalized Cross-Correlations
☆36Feb 11, 2025Updated last year
BingYang-20 / DP-RTF-Learning
View on GitHub
A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]
☆28Feb 11, 2023Updated 3 years ago
inverse-ai / FINALLY-Speech-Enhancement
View on GitHub
FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.
☆28Apr 1, 2026Updated 3 months ago
FYJNEVERFOLLOWS / LaBNet
View on GitHub
Official PyTorch implementation of the Interspeech 2023 paper
☆29Jul 5, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ffxiong / stsubnet
View on GitHub
☆22Oct 17, 2024Updated last year
Xiaobin-Rong / lite-rtse
View on GitHub
An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement
☆14Nov 19, 2023Updated 2 years ago
MiguelBlancoGalindo / MicArrayBeamforming
View on GitHub
Microphone Array Beamforming Toolbox
☆96Jun 27, 2020Updated 6 years ago
SoulProficiency / speechseparation-Sandglasset
View on GitHub
☆13Jun 24, 2021Updated 5 years ago
AdiCohen501 / ExNet-BF-PF
View on GitHub
☆15Jul 23, 2024Updated 2 years ago
seanwood / aspp
View on GitHub
ASPP: Binaural Speech Enhancement with Atomic Speech Presence Probability Estimation
☆20Jan 13, 2019Updated 7 years ago
ISmallFish / Libri-adhoc40
View on GitHub
A dataset collected from synchronized ad-hoc microphone arrays
☆19Apr 24, 2023Updated 3 years ago
anton-jeran / FAST-RIR
View on GitHub
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…
☆182Mar 19, 2026Updated 4 months ago
aleXiehta / AD-FlowTSE
View on GitHub
Adaptive Flow-Matching for Target Speaker Extraction
☆39Jul 13, 2026Updated last week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Le-Xiaohuai-speech / GMM_VAD
View on GitHub
☆17Apr 3, 2022Updated 4 years ago
tencent-ailab / FRA-RIR
View on GitHub
☆214Dec 4, 2023Updated 2 years ago
audiolabs / SC-Wind-Noise-Generator
View on GitHub
Generate synthetic wind noise signals based on a wind speed profile (Python)
☆52Apr 23, 2024Updated 2 years ago
JSerwatka / Acoustic-Source-Localization-System
View on GitHub
BSc Thesis: Acoustic source localization embedded system to estimate direction of sound arrival using time difference between sound wave …
☆37May 8, 2021Updated 5 years ago
taotaowang97479 / MFNet-SpeechEnhancement
View on GitHub
This is the unofficial implementation of MFNet, from paper''a Mask Free Neural Network for Monaural Speech Enhancement''
☆13Dec 20, 2024Updated last year
BASHLab / OWL
View on GitHub
☆15May 25, 2026Updated last month
Audio-WestlakeU / SAR-SSL
View on GitHub
A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…
☆40Oct 11, 2024Updated last year