vishalshar / SpeakerDiarization_RNN_CNN_LSTMLinks
Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should state when speaker starts and ends. In this project, we analyze given audio file with 2 channels and 2 speakers (on separate channels).
☆64Updated 4 years ago
Alternatives and similar repositories for SpeakerDiarization_RNN_CNN_LSTM
Users that are interested in SpeakerDiarization_RNN_CNN_LSTM are comparing it to the libraries listed below
Sorting:
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Updated 2 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 6 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 5 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Updated 6 years ago
- ☆60Updated 5 years ago
- Speech Enhancement using Bayesian WaveNet☆97Updated 7 years ago
- Text Independent Speaker Verification Using GE2E Loss☆84Updated 6 years ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆142Updated 2 years ago
- Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago
- Share some recent speaker recognition papers and their implementations.☆90Updated 6 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆79Updated 3 years ago
- Voice Activity Detection LSTM-RNN learning model☆50Updated 7 years ago
- A unofficial Pytorch implementation of Google's VoiceFilter☆102Updated 2 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆110Updated last year
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 7 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- A PyTorch implementation of SEGAN based on INTERSPEECH 2017 paper "SEGAN: Speech Enhancement Generative Adversarial Network"☆152Updated 5 years ago
- LogMMSE speech enhancement/noise reduction☆89Updated 5 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Updated 5 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 5 years ago
- An open-source speech separation and enhancement library☆213Updated 5 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 7 years ago
- This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1☆113Updated 6 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆59Updated 5 years ago
- ☆38Updated 5 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- fast SpecAugmentation code with numpy and scipy☆31Updated 6 years ago
- ☆35Updated 6 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆46Updated 5 years ago
- Include some core functions and model to handle speech separation☆155Updated 4 years ago