miguelcollette / audio_clustering
unsupervised clustering of speech / music, or genres of music
☆9Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for audio_clustering
- Constrained Permutation Invariant Training, Speech Separation☆43Updated 3 years ago
- multi-channel target speech extraction with channel decorrelation and target speaker adaptation☆25Updated 3 years ago
- Download and create a tfreader for the audioset dataset☆16Updated 4 years ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆52Updated last year
- implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain☆44Updated 4 years ago
- Implementation of "SpEx: Multi-Scale Time Domain Speaker Extraction Network".☆35Updated 4 years ago
- ☆46Updated last year
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆115Updated 2 years ago
- Tensorflow implementation of deep CASA☆63Updated 3 years ago
- This code is to run the WARP-Q speech quality metric.☆34Updated last month
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆39Updated last year
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆53Updated last year
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- multi-scale time domain speaker extraction☆60Updated 3 years ago
- DNN and RCED speech enhancement☆19Updated 9 months ago
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆89Updated last year
- The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"☆77Updated last year
- ☆43Updated last year
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆43Updated 4 years ago
- Streaming Audiotransformers for online Audio tagging☆41Updated 5 months ago
- The state-of-art time domain network for speech separation, and it performs well on speech enhancement and music separation☆42Updated 5 years ago
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆15Updated last year
- A non-intrusive objective metric for speech quality and intelligibility for normal hearing listeners and cochlear implant users☆69Updated last year
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 3 years ago
- Learning differentiable temporal resolution on time-series data.☆33Updated 2 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆48Updated 2 years ago
- Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation☆94Updated 3 years ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆13Updated 2 months ago
- Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation☆97Updated 2 years ago