Reading notes of speech or deep learning related papers, including Automatic Speech Recognition (ASR), Speech Enhancement and Dereverberation (SED), Speech Separation (SS), Sound Source Localization (SSL) and some other audio signal processing topics.
☆26Jun 8, 2023Updated 2 years ago
Alternatives and similar repositories for Paper-Reading-Notes
Users that are interested in Paper-Reading-Notes are comparing it to the libraries listed below
Sorting:
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- ResNet-STFT Model for Sound Source Localization☆20Aug 25, 2022Updated 3 years ago
- Generalized RNN beamformer for speech separation☆18Jan 11, 2022Updated 4 years ago
- [WIP]Direction based Multi-Channel Speech Separation☆14Jan 25, 2024Updated 2 years ago
- We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN☆24Jul 4, 2022Updated 3 years ago
- Official PyTorch implementation of the Interspeech 2023 paper☆28Jul 5, 2023Updated 2 years ago
- Nested U-Net with two-level skip connections for speech enhancement☆36Dec 18, 2023Updated 2 years ago
- A MATLAB implementation of “Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering” [TASLP 2019]☆11Oct 23, 2023Updated 2 years ago
- ☆24Feb 28, 2023Updated 3 years ago
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆48Jun 10, 2022Updated 3 years ago
- ☆10Jun 24, 2021Updated 4 years ago
- PASE: Phonologically Anchored Speech Enhancer☆38Dec 10, 2025Updated 2 months ago
- ☆16Jun 15, 2022Updated 3 years ago
- Official code of "DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement, IEEE Signal Processing Letters, 20…☆35Jan 26, 2026Updated last month
- A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]☆59Sep 28, 2024Updated last year
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago
- ☆11Aug 5, 2022Updated 3 years ago
- real-time speech enhance☆17Jan 23, 2024Updated 2 years ago
- ☆14Oct 12, 2023Updated 2 years ago
- Paderwasn is a collection of methods for acoustic signal processing in wireless acoustic sensor networks (WASNs).☆18May 8, 2025Updated 9 months ago
- Feedforward Sequential Memory Networks☆16Aug 2, 2022Updated 3 years ago
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆16May 19, 2023Updated 2 years ago
- Communication-Cost Aware Microphone Selection For Neural Speech Enhancement with Ad-hoc Microphone Arrays☆18Nov 20, 2020Updated 5 years ago
- ☆13Feb 1, 2026Updated last month
- Necessary and Sufficient Conditions for Observability of SLAM-based Microphone Array Calibration and Sound Source Localization☆14Mar 23, 2021Updated 4 years ago
- An unofficial implementation of DeepVQE proposed by Microsoft Corp.☆128Mar 24, 2025Updated 11 months ago
- This is the microphone array generalization investigation based on previous Narrow Band Deep Filtering methods.☆38Mar 12, 2024Updated last year
- This is the official implementation of PGUSE☆34Jun 7, 2025Updated 8 months ago
- 把 wave-u-net 网络应用于语音增强领域中☆14May 29, 2020Updated 5 years ago
- ☆20Apr 27, 2024Updated last year
- Full implementation of "End-to-end microphone permutation and number invariant multi-channel speech separation" (Interspeech 2020)☆75Sep 14, 2021Updated 4 years ago
- pre-process script for timit data for dnn-aec works☆36Mar 3, 2022Updated 4 years ago
- ☆18Mar 10, 2023Updated 2 years ago
- Graph Neural Networks for Sound Source Localization☆26Oct 31, 2023Updated 2 years ago
- Speech Separation☆18Mar 7, 2024Updated last year
- Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"☆229Apr 22, 2024Updated last year
- This is the official implementation of the LiSenNet☆149Nov 15, 2024Updated last year
- ☆18Oct 26, 2023Updated 2 years ago
- The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]☆141Feb 5, 2026Updated 3 weeks ago