shvmshukla / Speaker-Change-Detection
Speaker Diarization is the first step in many early audio processing and aims to solve the problem ”who spoke when”. It therefore relies on efficient use of temporal information from extracted audio features.
☆11Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for Speaker-Change-Detection
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 6 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- ☆56Updated 6 years ago
- Cochlear.ai submission for dcase2018 task2☆17Updated 6 years ago
- These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB☆50Updated 5 years ago
- Implementation of WaveNet with Gluon☆16Updated 5 years ago
- PyTorch implementations of neural network models for keyword spotting☆11Updated 4 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago
- A Text2Speech Engine built in Pytorch.☆11Updated 5 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆24Updated 4 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 5 years ago
- Tensorflow Implementation of WaveGlow☆37Updated 4 years ago
- ☆27Updated 5 years ago
- ☆31Updated 6 years ago
- Voice Conversion using Tacotron.☆11Updated last year
- ☆19Updated 6 years ago
- Code for ICASSP 2019 paper☆18Updated 6 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- Comprehensive Python library for speech and voice.☆33Updated last year
- ESPnet-TTS Audio Sample HP☆21Updated 5 years ago
- wavenet vocoder using tensorflow☆27Updated 6 years ago
- Recognizing common speech commands using Keras and Tensorflow.☆11Updated 5 years ago
- Pytorch Code for S2IGAN☆41Updated 4 years ago
- Random regression forests for audio event detection☆9Updated 7 years ago
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Updated 4 years ago
- Masked ConditionaL Neural Networks☆15Updated last year
- Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow☆17Updated 6 years ago
- Surrey CVSSP DCASE 2018 Task 2 system☆19Updated last year