shvmshukla / Speaker-Change-DetectionLinks
Speaker Diarization is the first step in many early audio processing and aims to solve the problem ”who spoke when”. It therefore relies on efficient use of temporal information from extracted audio features.
☆12Updated 6 years ago
Alternatives and similar repositories for Speaker-Change-Detection
Users that are interested in Speaker-Change-Detection are comparing it to the libraries listed below
Sorting:
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- Predicting emotions based on speech audio samples of American English, German and British English languages using Support Vector Machine,…☆20Updated 7 years ago
- Automatic Speech Recognition Dataset Generation☆37Updated 6 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 7 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆48Updated 8 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆33Updated 7 years ago
- Curriculum Vitae of Quan Wang☆15Updated this week
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Updated 4 years ago
- Comprehensive Python library for speech and voice.☆32Updated 2 years ago
- A Tensorflow Implementation like "Neural Speech Synthesis with Transformer Network" Port From OpenSeq2Seq☆20Updated last year
- ☆19Updated 7 years ago
- List of papers about TTS / Список статей о TTS☆10Updated 7 years ago
- Pytorch Code for S2IGAN☆41Updated 4 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆85Updated 4 years ago
- ☆31Updated 6 years ago
- An end-to-end Python pipeline for performing sentiment analysis on audio files of call-center conversations.☆36Updated 7 years ago
- 2nd place solution for ID R&D Voice Antispoofing Challenge☆15Updated 5 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- SpeechYOLO Interspeech 2019☆43Updated 2 years ago
- ESPnet-TTS Audio Sample HP☆21Updated 5 years ago
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Updated 5 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- ☆27Updated 6 years ago
- The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project descript…☆28Updated 5 years ago
- Tensorflow Implementation of WaveGlow☆37Updated 5 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 5 years ago
- Pytorch Hackathon☆7Updated 3 months ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Updated 5 years ago
- Text Independent Speaker Verification Using GE2E Loss☆84Updated 6 years ago