FYJNEVERFOLLOWS/Paper-Reading-Notes

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FYJNEVERFOLLOWS/Paper-Reading-Notes)

FYJNEVERFOLLOWS / Paper-Reading-Notes

Reading notes of speech or deep learning related papers, including Automatic Speech Recognition (ASR), Speech Enhancement and Dereverberation (SED), Speech Separation (SS), Sound Source Localization (SSL) and some other audio signal processing topics.

☆30

Alternatives and similar repositories for Paper-Reading-Notes

Users that are interested in Paper-Reading-Notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FYJNEVERFOLLOWS / ResNet-STFT-SSL
View on GitHub
ResNet-STFT Model for Sound Source Localization
☆20Aug 25, 2022Updated 3 years ago
kooBH / DSS
View on GitHub
[WIP]Direction based Multi-Channel Speech Separation
☆14Jan 25, 2024Updated 2 years ago
yongxuUSTC / grnnbf
View on GitHub
Generalized RNN beamformer for speech separation
☆18Jan 11, 2022Updated 4 years ago
Xiaobin-Rong / lite-rtse
View on GitHub
An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement
☆14Nov 19, 2023Updated 2 years ago
ZhongshuHou / MHA-DPCRN
View on GitHub
We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN
☆24Jul 4, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
FYJNEVERFOLLOWS / LaBNet
View on GitHub
Official PyTorch implementation of the Interspeech 2023 paper
☆29Jul 5, 2023Updated 3 years ago
BingYang-20 / TF-Wise-Spatial-Spectrum-Clustering
View on GitHub
A MATLAB implementation of “Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering” [TASLP 2019]
☆11Oct 23, 2023Updated 2 years ago
ichi131 / Direction-based-BiTSE
View on GitHub
☆15Sep 19, 2024Updated last year
seorim0 / NUNet-TLS
View on GitHub
Nested U-Net with two-level skip connections for speech enhancement
☆38Dec 18, 2023Updated 2 years ago
BingYang-20 / SRP-DNN
View on GitHub
A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]
☆66Sep 28, 2024Updated last year
JonathanDZ / TF-FaSNet
View on GitHub
☆24Feb 28, 2023Updated 3 years ago
egrinstein / gnn_ssl
View on GitHub
Graph Neural Networks for Sound Source Localization
☆29Oct 31, 2023Updated 2 years ago
Andong-Li-speech / TaylorBeamformer
View on GitHub
The implementation of TaylorBeamformer, which is in submission to Interspeech2022
☆49Jun 10, 2022Updated 4 years ago
Audio-WestlakeU / SAR-SSL
View on GitHub
A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…
☆40Oct 11, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ZhongYang2026 / Sandglasset-A-Light-Multi-Granularity-Self-Attentive-Network-For-Time-Domain-Speech-Separation
View on GitHub
Speech Separation
☆21Mar 7, 2024Updated 2 years ago
jwr1995 / WD-TCN
View on GitHub
☆11Aug 5, 2022Updated 3 years ago
hyyan2k / LiSenNet
View on GitHub
This is the official implementation of the LiSenNet
☆162Nov 15, 2024Updated last year
bear-boy / DPCRN-Pytorch
View on GitHub
Pytorch implementation of DPCRN
☆29Mar 31, 2024Updated 2 years ago
SoulProficiency / speechseparation-Sandglasset
View on GitHub
☆13Jun 24, 2021Updated 5 years ago
Okrio / FSPEN
View on GitHub
☆21Apr 27, 2024Updated 2 years ago
hyyan2k / PGUSE
View on GitHub
This is the official implementation of PGUSE
☆41Jun 7, 2025Updated last year
daobilige-su / obs-mic-array-calib
View on GitHub
Necessary and Sufficient Conditions for Observability of SLAM-based Microphone Array Calibration and Sound Source Localization
☆14Mar 23, 2021Updated 5 years ago
RusselZHANG / Microphone-Array-Generalization-for-Multichannel-Narrowband-Deep-Speech-Enhancement
View on GitHub
This is the microphone array generalization investigation based on previous Narrow Band Deep Filtering methods.
☆38Mar 12, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
AkenoSyuRi / DTLNPytorch
View on GitHub
This is an unofficial Pytorch implementation of the DTLN model repository, which contains denoising and inference code for the DTLN model…
☆23Jun 18, 2023Updated 3 years ago
wzhiyuyu / Wave-U-Net-for-SpeechEnhancement
View on GitHub
把 wave-u-net 网络应用于语音增强领域中
☆14May 29, 2020Updated 6 years ago
LiChenda / Multi-clue-TSE-data
View on GitHub
Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"
☆17May 19, 2023Updated 3 years ago
BUTSpeechFIT / cgmm_mvdr_online
View on GitHub
Implementation of CGMM-MVDR beamforming used for Clarity challenge
☆14Jan 14, 2022Updated 4 years ago
Audio-WestlakeU / Narrowband_DeepFiltering
View on GitHub
☆19Apr 1, 2020Updated 6 years ago
yoonsanghyu / FaSNet-TAC-PyTorch
View on GitHub
Full implementation of "End-to-end microphone permutation and number invariant multi-channel speech separation" (Interspeech 2020)
☆76Sep 14, 2021Updated 4 years ago
Orlllem / seld_wav2vec2
View on GitHub
☆18Feb 1, 2026Updated 5 months ago
Le-Xiaohuai-speech / DPCRN_DNS3
View on GitHub
Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"
☆236Apr 22, 2024Updated 2 years ago
zqlsnr / DPCRN
View on GitHub
real-time speech enhance
☆18Jan 23, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jmcasebeer / cost_aware_enhancement
View on GitHub
Communication-Cost Aware Microphone Selection For Neural Speech Enhancement with Ad-hoc Microphone Arrays
☆17Nov 20, 2020Updated 5 years ago
YongyuG / dnn_aec_data_process
View on GitHub
pre-process script for timit data for dnn-aec works
☆38Mar 3, 2022Updated 4 years ago
Audio-WestlakeU / FN-SSL
View on GitHub
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
☆159Mar 10, 2026Updated 4 months ago
yinruiqing / fsmn
View on GitHub
Feedforward Sequential Memory Networks
☆18Aug 2, 2022Updated 3 years ago
RoyChao19477 / SEMamba
View on GitHub
This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)
☆273Dec 12, 2025Updated 7 months ago
Okrio / deepvqe
View on GitHub
☆14Oct 12, 2023Updated 2 years ago
aircarlo / bin2bin-GAN-PLC
View on GitHub
bin2bin, a Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment
☆17Dec 29, 2023Updated 2 years ago