kosuke-kitahara / xlsr-wav2vec2-phoneme-recognition
☆29Updated 3 years ago
Alternatives and similar repositories for xlsr-wav2vec2-phoneme-recognition:
Users that are interested in xlsr-wav2vec2-phoneme-recognition are comparing it to the libraries listed below
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆24Updated last year
- multilingual speech aligner☆72Updated last year
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆84Updated 2 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 5 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆79Updated 3 years ago
- ☆97Updated 3 years ago
- asr2k☆49Updated 9 months ago
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 2 years ago
- ☆91Updated 3 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆45Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆50Updated 2 years ago
- An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".☆33Updated 3 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 3 years ago
- ☆40Updated 3 years ago
- Official Implementation of Mockingjay in Pytorch☆53Updated last year
- End-to-End Mispronunciation Detection via wav2vec2.0☆43Updated 3 years ago
- ☆53Updated 4 years ago
- Alignment files of LibriTTS.☆61Updated 4 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆60Updated 4 years ago
- ☆66Updated 2 months ago
- Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech☆96Updated 2 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆22Updated 2 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆58Updated 3 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- Clustering-based methods for overlapping diarization☆76Updated last year
- Keyword spotting and forced alignment in any language☆52Updated 8 months ago