FYJNEVERFOLLOWS / Paper-Reading-Notes
Reading notes of speech or deep learning related papers, including Automatic Speech Recognition (ASR), Speech Enhancement and Dereverberation (SED), Speech Separation (SS), Sound Source Localization (SSL) and some other audio signal processing topics.
☆23Updated last year
Alternatives and similar repositories for Paper-Reading-Notes:
Users that are interested in Paper-Reading-Notes are comparing it to the libraries listed below
- ☆56Updated last year
- A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]☆45Updated 6 months ago
- Official PyTorch implementation of the Interspeech 2023 paper☆24Updated last year
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆36Updated 6 months ago
- This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Ma…☆39Updated last year
- Keep track of good articles on speech processing, mainly on speech enhancement, include speech denoise, speech dereverberation and aec、ag…☆45Updated 9 months ago
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…☆27Updated 2 years ago
- Ablation study of local spectral attention (LSA) for full-band speech enhancement (SE)☆28Updated last year
- DNN-based SE in the frequency domain using Pytorch. You can test some state-of-the-art networks using T-F masking or spectral mapping met…☆53Updated 3 years ago
- Beam-guided TasNet☆50Updated 2 years ago
- A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]☆25Updated 2 years ago
- 语音增强领域的相关数据仿真工具和方法汇总--持续更新☆39Updated 9 months ago
- This is the official implementation of the LiSenNet☆81Updated 5 months ago
- A training code template for DNN-based speech enhancement.☆86Updated 3 weeks ago
- Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)☆59Updated 11 months ago
- This is the microphone array generalization investigation based on previous Narrow Band Deep Filtering methods.☆35Updated last year
- ☆58Updated 3 years ago
- The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]☆106Updated 4 months ago
- ☆49Updated 2 years ago
- Pytorch implementation of DPCRN☆14Updated last year
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆39Updated 7 months ago
- ☆41Updated 10 months ago
- ☆19Updated last year
- pre-process script for timit data for dnn-aec works☆35Updated 3 years ago
- 语音增强TFCN论文复现☆40Updated 3 years ago
- Cross-Domain Echo Controller☆32Updated 3 years ago
- 网络出处:Interactive Speech and Noise Modeling for Speech Enhancement☆28Updated 3 years ago
- wsj0-{2, 3, 4, 5} mix generation scripts, in Python.☆58Updated 4 years ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆54Updated 6 months ago
- Code repository for the paper Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks☆76Updated 2 years ago