chutaklee / CantoASR
Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)
☆14Updated 2 years ago
Alternatives and similar repositories for CantoASR:
Users that are interested in CantoASR are comparing it to the libraries listed below
- ☆56Updated 9 months ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year
- 56 language, 1 model Multilingual ASR☆25Updated 3 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- one script for xls-r/xlsr/whisper fine-tuning☆41Updated last year
- Finetuning VITS Efficiently☆32Updated last year
- ☆33Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- Zero-Shot Foreign Accent Conversion without a Native Reference☆31Updated 11 months ago
- Putting flows on top of neural transducers for better TTS☆62Updated 3 weeks ago
- multilingual speech aligner☆72Updated last year
- Convert English text from written expressions into spoken forms☆24Updated 2 years ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆74Updated last year
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆36Updated 2 years ago
- ☆42Updated 2 years ago
- Official Code for ParrotTTS☆48Updated 5 months ago
- cantonese-mandarin unsupervised neural translation for sw project☆26Updated last year
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 3 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆48Updated 9 months ago
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆69Updated 3 years ago
- ☆38Updated 3 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 5 months ago
- ☆34Updated 3 years ago
- ☆56Updated 2 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- ☆12Updated last year
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆84Updated 2 years ago
- Toolbox for easy and qualitative one-shot voice conversion☆45Updated 3 years ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- ☆112Updated 2 years ago