amirharati / kaldi-alligner
scripts to align a given wave to its transcription using trained models by Kaldi
☆32Updated 5 years ago
Alternatives and similar repositories for kaldi-alligner:
Users that are interested in kaldi-alligner are comparing it to the libraries listed below
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- multilingual speech aligner☆73Updated last year
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆63Updated last year
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Updated 2 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆74Updated 2 months ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Updated 5 years ago
- ☆40Updated 3 years ago
- a standalone pitch extractor☆13Updated 7 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- ☆25Updated 2 years ago
- create CMakeLists.txt for kaldi☆20Updated 4 years ago
- ☆76Updated 3 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 5 years ago
- ☆51Updated 6 years ago
- Tacotron2 with Global Style Tokens☆63Updated 5 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆61Updated 3 years ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆39Updated 3 years ago
- ☆35Updated last month
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆59Updated 3 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Updated 2 years ago
- An online speech recognition extension toolkit of Kaldi☆56Updated 3 years ago
- CMU multilingual speech repository☆31Updated 2 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆38Updated 4 years ago
- it's ASR decoder and make graph project☆32Updated 2 years ago
- Implementation of the AlignTTS☆76Updated last year
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Updated 5 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆42Updated 4 years ago