shvmshukla / Speaker-Change-Detection
Speaker Diarization is the first step in many early audio processing and aims to solve the problem ”who spoke when”. It therefore relies on efficient use of temporal information from extracted audio features.
☆11Updated 6 years ago
Alternatives and similar repositories for Speaker-Change-Detection:
Users that are interested in Speaker-Change-Detection are comparing it to the libraries listed below
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 6 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆47Updated 7 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Updated 4 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago
- WaveRNN-based waveform generator & demo of TensorFlow CuDNN-GRU usage.☆23Updated 6 years ago
- Recognizing common speech commands using Keras and Tensorflow.☆11Updated 6 years ago
- ☆31Updated 6 years ago
- TensorFlow Speech Recognition Challenge (Top 15%)☆14Updated 7 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆65Updated 4 years ago
- Emotion recognition of Speaker's Speech Data. Employ speaker detection classifiers for emotion recognition, a multiclass classification p…☆16Updated 9 years ago
- Tensorflow Implementation of WaveGlow☆37Updated 4 years ago
- ☆56Updated 6 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆34Updated 6 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Updated 5 years ago
- ☆27Updated 5 years ago
- Comprehensive Python library for speech and voice.☆33Updated 2 years ago
- Run speaker recognition algorithms - Mirrored from https://gitlab.idiap.ch/bob/bob.bio.spear☆19Updated last year
- Random regression forests for audio event detection☆9Updated 8 years ago
- Voice Conversion using Tacotron.☆11Updated 2 years ago
- List of papers about TTS / Список статей о TTS☆10Updated 7 years ago
- ESPnet-TTS Audio Sample HP☆21Updated 5 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 5 years ago
- Implementation of Differential Learning Rate in Keras☆11Updated 5 years ago
- ☆29Updated 4 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆24Updated 4 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago