vishalshar / SpeakerDiarization_RNN_CNN_LSTMLinks
Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should state when speaker starts and ends. In this project, we analyze given audio file with 2 channels and 2 speakers (on separate channels).
☆64Updated 5 years ago
Alternatives and similar repositories for SpeakerDiarization_RNN_CNN_LSTM
Users that are interested in SpeakerDiarization_RNN_CNN_LSTM are comparing it to the libraries listed below
Sorting:
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 6 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆96Updated 5 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Updated 6 years ago
- Text Independent Speaker Verification Using GE2E Loss☆84Updated 7 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated 2 years ago
- ☆60Updated 5 years ago
- Speech Enhancement using Bayesian WaveNet☆98Updated 7 years ago
- Voxceleb1 i-vector based speaker recognition system☆44Updated 7 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆79Updated 3 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 7 years ago
- Share some recent speaker recognition papers and their implementations.☆90Updated 6 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- A unofficial Pytorch implementation of Google's VoiceFilter☆103Updated 2 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 7 years ago
- Denoise Speech (Enhanced Speech or Speech enhancement) by Deep Learning (Using Keras and Tensorflow)☆39Updated 7 years ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆141Updated 2 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆111Updated last year
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Updated 5 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆40Updated 7 years ago
- Implementation of voice conversion system utilizing phonetic posteriorgrams (status: archive)☆81Updated 5 years ago
- A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.☆67Updated 6 years ago
- ☆35Updated 6 years ago
- Constrained Permutation Invariant Training, Speech Separation☆52Updated 4 years ago
- RASTA-PLP and MFCC tool based rasta-mat☆33Updated 3 years ago
- fast SpecAugmentation code with numpy and scipy☆31Updated 6 years ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Updated 6 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆46Updated 5 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆59Updated 6 years ago
- Speaker diarization with GMM-UBM and MAP Adaptation☆31Updated 7 years ago
- py-webrtcvad wrapper for trimming speech clips☆48Updated 3 years ago