miguelcollette / audio_clusteringLinks
unsupervised clustering of speech / music, or genres of music
☆9Updated 6 years ago
Alternatives and similar repositories for audio_clustering
Users that are interested in audio_clustering are comparing it to the libraries listed below
Sorting:
- Implementing the paper -☆19Updated last year
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆58Updated 7 months ago
- multi-channel target speech extraction with channel decorrelation and target speaker adaptation☆25Updated 4 years ago
- ☆16Updated 4 years ago
- The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal and to generat…☆29Updated 3 years ago
- ☆25Updated 7 months ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆49Updated 6 years ago
- ☆52Updated last year
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆121Updated 2 years ago
- Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021☆39Updated 3 years ago
- ☆14Updated 2 years ago
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆93Updated last year
- Single Channel Speech Enhancement Methods and Toolbox☆30Updated 3 months ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- ☆14Updated 3 years ago
- The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"☆77Updated 2 years ago
- Constrained Permutation Invariant Training, Speech Separation☆47Updated 4 years ago
- Tensorflow implementation of deep CASA☆65Updated 4 years ago
- Streaming Audiotransformers for online Audio tagging☆44Updated 11 months ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆16Updated 2 years ago
- A pytorch implementation of D3Net.☆11Updated 3 years ago
- ☆59Updated 4 years ago
- ☆81Updated 11 months ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆28Updated 10 months ago
- Download and create a tfreader for the audioset dataset☆16Updated 5 years ago
- ☆30Updated last year
- ☆65Updated last year
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆53Updated 2 weeks ago