Reading notes of speech or deep learning related papers, including Automatic Speech Recognition (ASR), Speech Enhancement and Dereverberation (SED), Speech Separation (SS), Sound Source Localization (SSL) and some other audio signal processing topics.
☆27Jun 8, 2023Updated 2 years ago
Alternatives and similar repositories for Paper-Reading-Notes
Users that are interested in Paper-Reading-Notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ResNet-STFT Model for Sound Source Localization☆20Aug 25, 2022Updated 3 years ago
- [WIP]Direction based Multi-Channel Speech Separation☆14Jan 25, 2024Updated 2 years ago
- Generalized RNN beamformer for speech separation☆18Jan 11, 2022Updated 4 years ago
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN☆24Jul 4, 2022Updated 3 years ago
- Official PyTorch implementation of the Interspeech 2023 paper☆28Jul 5, 2023Updated 2 years ago
- A MATLAB implementation of “Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering” [TASLP 2019]☆11Oct 23, 2023Updated 2 years ago
- This is the official implementation of PGUSE☆37Jun 7, 2025Updated 9 months ago
- A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]☆60Sep 28, 2024Updated last year
- Nested U-Net with two-level skip connections for speech enhancement☆36Dec 18, 2023Updated 2 years ago
- ☆24Feb 28, 2023Updated 3 years ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆40Oct 11, 2024Updated last year
- PASE: Phonologically Anchored Speech Enhancer☆44Dec 10, 2025Updated 3 months ago
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆48Jun 10, 2022Updated 3 years ago
- Graph Neural Networks for Sound Source Localization☆26Oct 31, 2023Updated 2 years ago
- ☆21Apr 27, 2024Updated last year
- ☆11Aug 5, 2022Updated 3 years ago
- ☆10Jun 24, 2021Updated 4 years ago
- ☆16Jun 15, 2022Updated 3 years ago
- This is the official implementation of the LiSenNet☆154Nov 15, 2024Updated last year
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆16May 19, 2023Updated 2 years ago
- Official code of "DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement, IEEE Signal Processing Letters, 20…☆35Jan 26, 2026Updated last month
- ☆15Feb 1, 2026Updated last month
- An unofficial implementation of DeepVQE proposed by Microsoft Corp.☆130Mar 24, 2025Updated 11 months ago
- This is the microphone array generalization investigation based on previous Narrow Band Deep Filtering methods.☆38Mar 12, 2024Updated 2 years ago
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago
- This is an unofficial Pytorch implementation of the DTLN model repository, which contains denoising and inference code for the DTLN model…☆24Jun 18, 2023Updated 2 years ago
- Full implementation of "End-to-end microphone permutation and number invariant multi-channel speech separation" (Interspeech 2020)☆75Sep 14, 2021Updated 4 years ago
- 把 wave-u-net 网络应用于语音增强领域中☆14May 29, 2020Updated 5 years ago
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆256Dec 12, 2025Updated 3 months ago
- Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"☆229Apr 22, 2024Updated last year
- real-time speech enhance☆17Jan 23, 2024Updated 2 years ago
- ☆18Oct 26, 2023Updated 2 years ago
- Necessary and Sufficient Conditions for Observability of SLAM-based Microphone Array Calibration and Sound Source Localization☆14Mar 23, 2021Updated 5 years ago
- ☆19Apr 1, 2020Updated 5 years ago
- pre-process script for timit data for dnn-aec works☆37Mar 3, 2022Updated 4 years ago
- We have trained an intelligent agent that draws bounding boxes around an object in the image. This implementation combines CNN, DQN, and …☆10Oct 29, 2025Updated 4 months ago
- Feedforward Sequential Memory Networks☆16Aug 2, 2022Updated 3 years ago
- ☆14Oct 12, 2023Updated 2 years ago