Reading notes of speech or deep learning related papers, including Automatic Speech Recognition (ASR), Speech Enhancement and Dereverberation (SED), Speech Separation (SS), Sound Source Localization (SSL) and some other audio signal processing topics.
☆30Jun 8, 2023Updated 2 years ago
Alternatives and similar repositories for Paper-Reading-Notes
Users that are interested in Paper-Reading-Notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ResNet-STFT Model for Sound Source Localization☆20Aug 25, 2022Updated 3 years ago
- [WIP]Direction based Multi-Channel Speech Separation☆14Jan 25, 2024Updated 2 years ago
- Generalized RNN beamformer for speech separation☆18Jan 11, 2022Updated 4 years ago
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN☆24Jul 4, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official PyTorch implementation of the Interspeech 2023 paper☆29Jul 5, 2023Updated 2 years ago
- A MATLAB implementation of “Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering” [TASLP 2019]☆11Oct 23, 2023Updated 2 years ago
- A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]☆63Sep 28, 2024Updated last year
- This is the official implementation of PGUSE☆40Jun 7, 2025Updated 11 months ago
- Nested U-Net with two-level skip connections for speech enhancement☆36Dec 18, 2023Updated 2 years ago
- ☆24Feb 28, 2023Updated 3 years ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆40Oct 11, 2024Updated last year
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆48Jun 10, 2022Updated 3 years ago
- ☆21Apr 27, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆11Aug 5, 2022Updated 3 years ago
- Graph Neural Networks for Sound Source Localization☆27Oct 31, 2023Updated 2 years ago
- PASE: Phonologically Anchored Speech Enhancer☆59Apr 9, 2026Updated last month
- ☆13Jun 24, 2021Updated 4 years ago
- ☆16Jun 15, 2022Updated 3 years ago
- This is the official implementation of the LiSenNet☆159Nov 15, 2024Updated last year
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆17May 19, 2023Updated 3 years ago
- ☆15Sep 19, 2024Updated last year
- Official code of "DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement, IEEE Signal Processing Letters, 20…☆36Jan 26, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is the microphone array generalization investigation based on previous Narrow Band Deep Filtering methods.☆38Mar 12, 2024Updated 2 years ago
- An unofficial implementation of DeepVQE proposed by Microsoft Corp.☆141Mar 24, 2025Updated last year
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago
- This is an unofficial Pytorch implementation of the DTLN model repository, which contains denoising and inference code for the DTLN model…☆24Jun 18, 2023Updated 2 years ago
- Full implementation of "End-to-end microphone permutation and number invariant multi-channel speech separation" (Interspeech 2020)☆76Sep 14, 2021Updated 4 years ago
- ☆17Feb 1, 2026Updated 3 months ago
- 把 wave-u-net 网络应用于语音增强领域中☆14May 29, 2020Updated 5 years ago
- Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"☆231Apr 22, 2024Updated 2 years ago
- real-time speech enhance☆17Jan 23, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆262Dec 12, 2025Updated 5 months ago
- ☆19Oct 26, 2023Updated 2 years ago
- Necessary and Sufficient Conditions for Observability of SLAM-based Microphone Array Calibration and Sound Source Localization☆14Mar 23, 2021Updated 5 years ago
- ☆19Apr 1, 2020Updated 6 years ago
- We have trained an intelligent agent that draws bounding boxes around an object in the image. This implementation combines CNN, DQN, and …☆10Apr 14, 2026Updated last month
- pre-process script for timit data for dnn-aec works☆38Mar 3, 2022Updated 4 years ago
- Feedforward Sequential Memory Networks☆17Aug 2, 2022Updated 3 years ago