BingYang-20/DP-RTF-Learning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/BingYang-20/DP-RTF-Learning)

BingYang-20 / DP-RTF-Learning

A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]

☆28

Alternatives and similar repositories for DP-RTF-Learning

Users that are interested in DP-RTF-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BingYang-20 / SRP-DNN
View on GitHub
A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]
☆66Sep 28, 2024Updated last year
Audio-WestlakeU / FN-SSL
View on GitHub
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
☆159Mar 10, 2026Updated 4 months ago
Audio-WestlakeU / SAR-SSL
View on GitHub
A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…
☆40Oct 11, 2024Updated last year
Audio-WestlakeU / RealMAN
View on GitHub
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…
☆175Apr 29, 2025Updated last year
ISmallFish / Libri-adhoc40
View on GitHub
A dataset collected from synchronized ad-hoc microphone arrays
☆19Apr 24, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Fhrozen / jrm_ssl
View on GitHub
Files for the paper: "Sound Source Localization using Deep Residual Learning"
☆24Nov 13, 2017Updated 8 years ago
LiChenda / Multi-clue-TSE-data
View on GitHub
Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"
☆17May 19, 2023Updated 3 years ago
SouppuoS / CDMA
View on GitHub
Circular Differential Microphone Arrays (CDMA) implemented by python
☆32May 10, 2022Updated 4 years ago
RusselZHANG / Microphone-Array-Generalization-for-Multichannel-Narrowband-Deep-Speech-Enhancement
View on GitHub
This is the microphone array generalization investigation based on previous Narrow Band Deep Filtering methods.
☆38Mar 12, 2024Updated 2 years ago
zjsong / SSPL
View on GitHub
PyTorch code for "Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes" (CVPR, 2022…
☆32Jul 8, 2024Updated 2 years ago
aispeech-lab / SDNet
View on GitHub
Pytorch implemention of SDNet
☆23Jun 1, 2021Updated 5 years ago
GeWu-Lab / Generalizable-Audio-Visual-Segmentation
View on GitHub
Official repository of "Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer", AAAI 2024
☆28Mar 14, 2026Updated 4 months ago
ssi-research / FQSE
View on GitHub
Fully Quantized Neural Networks For Speech Enhancement
☆65Feb 15, 2024Updated 2 years ago
yoonsanghyu / FaSNet-TAC-PyTorch
View on GitHub
Full implementation of "End-to-end microphone permutation and number invariant multi-channel speech separation" (Interspeech 2020)
☆76Sep 14, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Audio-WestlakeU / RCT
View on GitHub
This repo gives the code for the official implementation of RCT.
☆13Jun 28, 2022Updated 4 years ago
idiap / nnsslm
View on GitHub
Neural Network based Sound Source Localization Models
☆51Aug 29, 2023Updated 2 years ago
JonathanDZ / TF-FaSNet
View on GitHub
☆24Feb 28, 2023Updated 3 years ago
Audio-WestlakeU / pytorch_lightning_template_for_beginners
View on GitHub
A pytorch template for beginners based on pytorch_lightning
☆50Feb 1, 2024Updated 2 years ago
introlab / uimvdr
View on GitHub
☆13Oct 11, 2024Updated last year
Audio-WestlakeU / Mel-McNet
View on GitHub
The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]
☆26May 14, 2026Updated 2 months ago
tencent-ailab / FRA-RIR
View on GitHub
☆214Dec 4, 2023Updated 2 years ago
VisualAIKHU / SIRA-SSL
View on GitHub
Official Repository for "Audio-Visual Spatial Integration and Recursive Attention for Robust Sound Source Localization" (ACM MM 2023)
☆18Nov 14, 2023Updated 2 years ago
bingo-todd / GCC-PHAT_DNN_Loc
View on GitHub
DNN based binaural sound localization model, using GCC-PHAT as features
☆22Jun 13, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
myuzhao / BeamformOpt
View on GitHub
传感器阵列波束优化设计与应用部分章节程序
☆32Feb 23, 2019Updated 7 years ago
Le-Xiaohuai-speech / SKIP-DPCRN
View on GitHub
☆51Jun 14, 2022Updated 4 years ago
Audio-WestlakeU / McNet
View on GitHub
The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023
☆130Mar 24, 2023Updated 3 years ago
donghoney0416 / DeepASA
View on GitHub
Official page of "DeepASA: An Object-Oriented Multi-Purpose Network for Auditory Scene Analysis"
☆26Apr 15, 2026Updated 3 months ago
875441459 / Design_DMA
View on GitHub
An implementation of frequency-invariant beamformer
☆14Sep 3, 2021Updated 4 years ago
prerak23 / RoomParamEstim
View on GitHub
This is the code for the WASPAA 2021 paper "Blind Room Parameter Estimation Using Multiple Multichannel Speech Recordings
☆17Nov 9, 2022Updated 3 years ago
Audio-WestlakeU / NBSS
View on GitHub
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
☆362Jan 1, 2025Updated last year
FrancoisGrondin / steernet
View on GitHub
☆27May 14, 2020Updated 6 years ago
ChengGuoliang0 / SBSS-NAEC-CTF
View on GitHub
☆16Oct 31, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
changxuding / Residual_Echo_Cancellation
View on GitHub
Various Algorithm for Residual Echo Cancellation
☆32Jul 6, 2023Updated 3 years ago
audiolabs / MonteCarloRIRSimulation
View on GitHub
Room impulse response simulation for various array architectures using Monte-Carlo simulation and quaternions (Python)
☆18Feb 25, 2026Updated 4 months ago
ASLP-lab / Smart-Glass-Challenge
View on GitHub
☆17Jun 16, 2026Updated last month
kamo-naoyuki / pytorch_complex
View on GitHub
A temporal module for PyTorch-ComplexTensor
☆44Jun 28, 2024Updated 2 years ago
AmbiqAI / nnse
View on GitHub
NNSE (Neural Network Speech Enhancement) is a speech-denoiser optimized to run on Ambiq's low power platform
☆44Nov 13, 2025Updated 8 months ago
desh2608 / css
View on GitHub
PyTorch implementation of Continuous Speech Separation
☆12Oct 5, 2022Updated 3 years ago
urgent-challenge / urgent2024_challenge
View on GitHub
Official data preparation scripts for the URGENT 2024 Challenge
☆90May 21, 2025Updated last year