v-nhandt21 / Viphoneme
Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA
☆67Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for Viphoneme
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆49Updated last year
- XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)☆304Updated 3 months ago
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆20Updated 3 months ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆93Updated 3 years ago
- Transformation spoken text to written text☆28Updated 5 months ago
- A Vietnamese phonetizer☆48Updated 5 months ago
- ☆41Updated 2 months ago
- Finetune Wa2vec 2.0 For Speech Recognition☆111Updated last year
- Solution for Zalo AI Challenge 2022 - Lyrics Alignment☆67Updated last year
- Vietnamese Punctuation Prediction using Pretrained Language Models☆13Updated 2 years ago
- Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS☆39Updated 11 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆59Updated 3 months ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆33Updated last year
- A synthesized dataset for Vietnamese TTS task☆58Updated 2 years ago
- ☆16Updated 2 years ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆112Updated 2 years ago
- Official repo for the Vietnam-Celeb dataset☆19Updated last year
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆79Updated last year
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆35Updated 2 years ago
- This repository provides some useful snippets that you may need in some situations.☆10Updated 9 months ago
- FlashSpeech: Efficient Zero-Shot Speech Synthesis☆93Updated last month
- ☆50Updated 9 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆76Updated last year
- ☆53Updated 4 months ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆58Updated 2 years ago
- It's a repository for implementations of neural speech editing algorithms.☆191Updated 10 months ago
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆121Updated 8 months ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 4 years ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆190Updated 2 years ago