shvmshukla / Speaker-Change-Detection
Speaker Diarization is the first step in many early audio processing and aims to solve the problem ”who spoke when”. It therefore relies on efficient use of temporal information from extracted audio features.
☆12Updated 6 years ago
Alternatives and similar repositories for Speaker-Change-Detection:
Users that are interested in Speaker-Change-Detection are comparing it to the libraries listed below
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Updated 4 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 5 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 7 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆47Updated 8 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- ☆31Updated 6 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- Voice Conversion using Tacotron.☆11Updated 2 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆86Updated 4 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago
- Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challeng…☆58Updated 6 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆65Updated 4 years ago
- Emotion recognition of Speaker's Speech Data. Employ speaker detection classifiers for emotion recognition, a multiclass classification p…☆16Updated 9 years ago
- Best Collection of Articles and code for Audio Classification☆15Updated 5 years ago
- ☆56Updated 6 years ago
- TensorFlow Speech Recognition Challenge (Top 15%)☆14Updated 7 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- Pytorch Code for S2IGAN☆41Updated 4 years ago
- Example implementation of Monotonic Chunkwise Attention.☆51Updated 7 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- SpeechYOLO Interspeech 2019☆43Updated 2 years ago
- Tensorflow Implementation of WaveGlow☆37Updated 4 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆25Updated 4 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated 7 months ago
- implement end-to-end asr algorithm with tensorflow☆40Updated 6 years ago
- Curriculum Vitae of Quan Wang☆15Updated 2 months ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆34Updated 6 years ago