chutaklee / CantoASR
Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)
☆14Updated 2 years ago
Alternatives and similar repositories for CantoASR:
Users that are interested in CantoASR are comparing it to the libraries listed below
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 2 years ago
- ☆56Updated 2 years ago
- multilingual speech aligner☆73Updated last year
- Neural network-based forced alignment with bidirectional attention mechanism☆71Updated this week
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆74Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- ☆33Updated last year
- Phoneme segmentation using pre-trained speech models☆54Updated 2 years ago
- Finetuning VITS Efficiently☆32Updated last year
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆120Updated 2 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆82Updated 2 years ago
- ☆37Updated 3 years ago
- 56 language, 1 model Multilingual ASR☆24Updated 3 years ago
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆67Updated 3 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆50Updated 2 years ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆113Updated 2 years ago
- ☆52Updated 6 months ago
- ☆111Updated 2 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆57Updated 3 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Updated 3 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆76Updated last year
- Toolbox for easy and qualitative one-shot voice conversion☆45Updated 3 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 3 years ago
- Huawei Grad-TTS for Chinese☆45Updated last year
- Putting flows on top of neural transducers for better TTS☆62Updated 3 weeks ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago
- TransferTTS (Zero-Shot learning of VITS)☆94Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 6 months ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆130Updated last year
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆87Updated 2 years ago