sarulab-speech / jtubespeechLinks
☆222Updated last year
Alternatives and similar repositories for jtubespeech
Users that are interested in jtubespeech are comparing it to the libraries listed below
Sorting:
- ☆87Updated 4 years ago
- context labels and pronunciation data for JSUT corpus☆70Updated 3 years ago
- xvector model on jtubespeech☆45Updated last year
- ESPnet Model Zoo☆252Updated last year
- End-to-End Neural Diarization☆402Updated 3 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆337Updated last year
- HTS-style full-context labels for JSUT v1.1☆47Updated 4 years ago
- Onnx wrapper for espnet infrernce model☆163Updated 8 months ago
- An advance kaldi wrapper for Pyhton☆38Updated 4 years ago
- ☆32Updated 2 years ago
- UT-Sarulab MOS prediction system using SSL models☆243Updated last year
- A pure python module for reading and writing kaldi ark files☆259Updated 3 months ago
- Charsiu: A neural phonetic aligner.☆305Updated 2 years ago
- ☆49Updated 5 months ago
- see README☆349Updated 11 months ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆141Updated last year
- Official implementation of the source-filter HiFiGAN vocoder☆254Updated last year
- Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context☆195Updated 9 months ago
- A curated list of awesome papers on contextualizing E2E ASR outputs☆77Updated 2 years ago
- Multilingual G2P in 100 languages☆331Updated 2 years ago
- Easy-to-Use Speech MOS predictors☆290Updated last year
- Data and code for grapheme-to-phoneme transducers in lots of languages☆137Updated last year
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆369Updated 11 months ago
- ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)☆259Updated 2 years ago
- INTERSPEECH 2019 Tutorial Materials☆193Updated 4 years ago
- Unofficial implementation of miipher☆129Updated last year
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆145Updated 3 years ago
- ☆67Updated 2 weeks ago
- Towards hot directions in industrial end to end speech recognition☆326Updated 3 years ago
- ☆273Updated 4 years ago