vishalshar / SpeakerDiarization_RNN_CNN_LSTMLinks
Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should state when speaker starts and ends. In this project, we analyze given audio file with 2 channels and 2 speakers (on separate channels).
☆64Updated 4 years ago
Alternatives and similar repositories for SpeakerDiarization_RNN_CNN_LSTM
Users that are interested in SpeakerDiarization_RNN_CNN_LSTM are comparing it to the libraries listed below
Sorting:
- ☆60Updated 4 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Updated 2 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 5 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- Text Independent Speaker Verification Using GE2E Loss☆84Updated 6 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Updated 6 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 3 years ago
- A unofficial Pytorch implementation of Google's VoiceFilter☆100Updated 2 years ago
- Share some recent speaker recognition papers and their implementations.☆90Updated 5 years ago
- This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1☆112Updated 6 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- Speech Enhancement using Bayesian WaveNet☆96Updated 7 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆109Updated last year
- This Repository includes four different implementations of the Speaker Verification task including the GMM_UBM, Ivector, Deep-Speaker, an…☆32Updated 7 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆59Updated 5 years ago
- RASTA-PLP and MFCC tool based rasta-mat☆33Updated 3 years ago
- Voice Activity Detection LSTM-RNN learning model☆50Updated 7 years ago
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Updated 5 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Updated 5 years ago
- Implementing speaker recognition using Python (GMM-UBM)☆29Updated 7 years ago
- Tensorflow implementation of "Speaker-independent Speech Separation with Deep Attractor Network"☆90Updated 4 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Updated 5 years ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆142Updated last year
- Bidirectional dynamic RNN + CTC for phoneme recognition☆46Updated 5 years ago
- A Convolutional Neural Network based Voice Activity Detector for Smartphones☆71Updated 6 years ago
- LogMMSE speech enhancement/noise reduction☆88Updated 5 years ago
- Constrained Permutation Invariant Training, Speech Separation☆47Updated 4 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆60Updated 4 years ago
- Speech enhancement system for the CHiME-5 dinner party scenario☆109Updated 5 months ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 7 years ago