Reading notes of speech or deep learning related papers, including Automatic Speech Recognition (ASR), Speech Enhancement and Dereverberation (SED), Speech Separation (SS), Sound Source Localization (SSL) and some other audio signal processing topics.
☆29Jun 8, 2023Updated 2 years ago
Alternatives and similar repositories for Paper-Reading-Notes
Users that are interested in Paper-Reading-Notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ResNet-STFT Model for Sound Source Localization☆20Aug 25, 2022Updated 3 years ago
- [WIP]Direction based Multi-Channel Speech Separation☆14Jan 25, 2024Updated 2 years ago
- Generalized RNN beamformer for speech separation☆18Jan 11, 2022Updated 4 years ago
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN☆24Jul 4, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official PyTorch implementation of the Interspeech 2023 paper☆29Jul 5, 2023Updated 2 years ago
- A MATLAB implementation of “Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering” [TASLP 2019]☆11Oct 23, 2023Updated 2 years ago
- A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]☆61Sep 28, 2024Updated last year
- This is the official implementation of PGUSE☆40Jun 7, 2025Updated 10 months ago
- Nested U-Net with two-level skip connections for speech enhancement☆36Dec 18, 2023Updated 2 years ago
- ☆24Feb 28, 2023Updated 3 years ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆40Oct 11, 2024Updated last year
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆48Jun 10, 2022Updated 3 years ago
- PASE: Phonologically Anchored Speech Enhancer☆57Apr 9, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆21Apr 27, 2024Updated 2 years ago
- ☆11Aug 5, 2022Updated 3 years ago
- Graph Neural Networks for Sound Source Localization☆27Oct 31, 2023Updated 2 years ago
- ☆13Jun 24, 2021Updated 4 years ago
- ☆16Jun 15, 2022Updated 3 years ago
- This is the official implementation of the LiSenNet☆159Nov 15, 2024Updated last year
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆17May 19, 2023Updated 2 years ago
- ☆14Sep 19, 2024Updated last year
- Official code of "DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement, IEEE Signal Processing Letters, 20…☆36Jan 26, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This is the microphone array generalization investigation based on previous Narrow Band Deep Filtering methods.☆38Mar 12, 2024Updated 2 years ago
- An unofficial implementation of DeepVQE proposed by Microsoft Corp.☆139Mar 24, 2025Updated last year
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago
- This is an unofficial Pytorch implementation of the DTLN model repository, which contains denoising and inference code for the DTLN model…☆24Jun 18, 2023Updated 2 years ago
- Full implementation of "End-to-end microphone permutation and number invariant multi-channel speech separation" (Interspeech 2020)☆76Sep 14, 2021Updated 4 years ago
- ☆16Feb 1, 2026Updated 3 months ago
- 把 wave-u-net 网络应用于语音增强领域中☆14May 29, 2020Updated 5 years ago
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆260Dec 12, 2025Updated 4 months ago
- Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"☆230Apr 22, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- real-time speech enhance☆17Jan 23, 2024Updated 2 years ago
- ☆18Oct 26, 2023Updated 2 years ago
- Necessary and Sufficient Conditions for Observability of SLAM-based Microphone Array Calibration and Sound Source Localization☆14Mar 23, 2021Updated 5 years ago
- ☆19Apr 1, 2020Updated 6 years ago
- We have trained an intelligent agent that draws bounding boxes around an object in the image. This implementation combines CNN, DQN, and …☆10Apr 14, 2026Updated 2 weeks ago
- pre-process script for timit data for dnn-aec works☆38Mar 3, 2022Updated 4 years ago
- Feedforward Sequential Memory Networks☆17Aug 2, 2022Updated 3 years ago