cpii-cai / PunCantonese
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Updated 4 months ago
Alternatives and similar repositories for PunCantonese:
Users that are interested in PunCantonese are comparing it to the libraries listed below
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆9Updated 3 years ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 7 months ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆16Updated 3 weeks ago
- A library of speech gadgets.☆13Updated 2 years ago
- ☆13Updated 8 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆21Updated last year
- Phonemes and durations labeling based on whisper small☆11Updated 9 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 3 years ago
- ☆19Updated last year
- ☆13Updated 3 years ago
- ☆11Updated 2 months ago
- Speech Resynthesis and Language Modeling Using Flow Matching and Llama☆17Updated this week
- ☆11Updated 2 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated 6 months ago
- text to speech☆10Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 8 months ago
- Paper, Code and Statistics for Speech Generatation.☆10Updated 2 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Updated 2 years ago
- A handy dataset of noises for ASR☆21Updated 5 years ago
- ☆12Updated 2 months ago
- Torchaudio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆11Updated 4 months ago
- Forced alignment decoder for Whisper.☆14Updated last year
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Updated 4 years ago
- End-to-End SpeechSynthesis system with fastspeech2 & hifigan☆13Updated 2 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆19Updated 2 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 4 years ago
- Production-ready vocoder using BigVSAN☆11Updated last year
- Chinese Mandarin Synthesis Corpus-Female/Emotional☆10Updated 8 months ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago