kosuke-kitahara / xlsr-wav2vec2-phoneme-recognition
☆27Updated 3 years ago
Related projects: ⓘ
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆19Updated 7 months ago
- asr2k☆48Updated 3 months ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆24Updated 5 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆45Updated last year
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆84Updated 3 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆44Updated 2 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆78Updated 2 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated last year
- Phoneme segmentation using pre-trained speech models☆49Updated last year
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆47Updated last year
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆78Updated last year
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆17Updated 2 years ago
- ☆57Updated 2 weeks ago
- multilingual speech aligner☆70Updated 10 months ago
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 2 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆27Updated 4 months ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆39Updated 3 years ago
- ☆56Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆95Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- Toolbox for easy and qualitative one-shot voice conversion☆44Updated 2 years ago
- ☆28Updated 4 years ago
- Interface for Controllable Expressive Talking Machine☆37Updated 8 months ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆86Updated 2 years ago
- ☆40Updated 2 years ago
- Demo for 2022 ICASSP☆64Updated 2 years ago
- Official PyTorch implementation of TTS Style Transfer☆24Updated 2 years ago
- ☆96Updated 3 years ago
- ☆26Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆39Updated 2 months ago