miguelcollette / audio_clustering
unsupervised clustering of speech / music, or genres of music
☆9Updated 6 years ago
Alternatives and similar repositories for audio_clustering:
Users that are interested in audio_clustering are comparing it to the libraries listed below
- multi-channel target speech extraction with channel decorrelation and target speaker adaptation☆25Updated 4 years ago
- The state-of-art time domain network for speech separation, and it performs well on speech enhancement and music separation☆44Updated 6 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆41Updated 3 years ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆55Updated 6 months ago
- Constrained Permutation Invariant Training, Speech Separation☆47Updated 4 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆37Updated 7 months ago
- Python 3.5 and Windows version of Speech Enhancement using DNN by Yong Xu and Qiuqiang Kong☆15Updated 6 years ago
- Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement☆21Updated 3 years ago
- ☆31Updated 2 years ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆28Updated 9 months ago
- Dual-Path RNN for Single-Channel Speech Separation (in Keras-Tensorflow2)☆34Updated 4 years ago
- ☆50Updated last year
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆22Updated 3 years ago
- PyTorch implementation of LiMuSE☆30Updated 2 years ago
- implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain☆46Updated 4 years ago
- ☆30Updated last year
- ☆34Updated 3 years ago
- This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', wh…☆67Updated 2 years ago
- ☆43Updated 2 years ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆57Updated 7 months ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- ☆41Updated 5 years ago
- implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch☆51Updated 3 years ago
- The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"☆77Updated 2 years ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- DNN-based SE in the frequency domain using Pytorch. You can test some state-of-the-art networks using T-F masking or spectral mapping met…☆54Updated 3 years ago
- Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications☆44Updated 3 years ago
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆28Updated last year
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆16Updated 2 years ago
- Tensorflow implementation of deep CASA☆65Updated 3 years ago