shvmshukla / Speaker-Change-DetectionLinks
Speaker Diarization is the first step in many early audio processing and aims to solve the problem ”who spoke when”. It therefore relies on efficient use of temporal information from extracted audio features.
☆12Updated 6 years ago
Alternatives and similar repositories for Speaker-Change-Detection
Users that are interested in Speaker-Change-Detection are comparing it to the libraries listed below
Sorting:
- Anonymous ICLR Submission☆14Updated 5 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆49Updated 8 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Updated 5 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 4 years ago
- ☆31Updated 6 years ago
- Tensorflow Implementation of WaveGlow☆37Updated 5 years ago
- ☆56Updated 6 years ago
- Comprehensive Python library for speech and voice.☆32Updated 2 years ago
- The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project descript…☆28Updated 5 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Updated 4 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆85Updated 4 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 7 years ago
- A Tensorflow Implementation like "Neural Speech Synthesis with Transformer Network" Port From OpenSeq2Seq☆20Updated 2 years ago
- List of papers about TTS / Список статей о TTS☆10Updated 7 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 4 years ago
- Example implementation of Monotonic Chunkwise Attention.☆52Updated 7 years ago
- Recognizing common speech commands using Keras and Tensorflow.☆11Updated 6 years ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Updated 7 years ago
- Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)☆52Updated 6 years ago
- ABX and kaldi experiments on speech corpora made easy☆33Updated 10 months ago
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- Basic wavenet and fftnet vocoder model.☆19Updated 3 years ago
- ☆38Updated 5 years ago
- readers that enable reading kaldi ark in tensorflow☆17Updated 7 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆27Updated 4 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 5 years ago
- Automatic Speech Recognition Dataset Generation☆37Updated 6 years ago