chutaklee / CantoASR
Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)
☆14Updated 2 years ago
Related projects: ⓘ
- Zero-Shot Foreign Accent Conversion without a Native Reference☆27Updated 4 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- Finetuning VITS Efficiently☆31Updated 10 months ago
- 56 language, 1 model Multilingual ASR☆23Updated 3 years ago
- multilingual speech aligner☆70Updated 10 months ago
- one script for xls-r/xlsr/whisper fine-tuning☆37Updated last year
- ☆56Updated last year
- ☆32Updated last year
- An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".☆33Updated 3 years ago
- ☆39Updated this week
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆45Updated last year
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆66Updated 3 years ago
- asr2k☆48Updated 3 months ago
- ☆28Updated this week
- Convert English text from written expressions into spoken forms☆19Updated 2 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆69Updated 3 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 2 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆65Updated 10 months ago
- ☆52Updated 3 years ago
- ☆33Updated 2 years ago
- Prosodic Speech Segmentation with Transformers☆22Updated 6 months ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆29Updated last month
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆81Updated last year
- ☆28Updated this week
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆56Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆74Updated 2 months ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆34Updated 3 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆44Updated 2 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆70Updated last year