Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA
☆103Jun 21, 2024Updated last year
Alternatives and similar repositories for Viphoneme
Users that are interested in Viphoneme are comparing it to the libraries listed below
Sorting:
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆66Jan 1, 2025Updated last year
- Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS☆57Dec 1, 2023Updated 2 years ago
- A Vietnamese phonetizer☆53May 29, 2024Updated last year
- Vietnamese Text to Speech library☆253Aug 20, 2023Updated 2 years ago
- Fine-tuning Vietnamese Text-to-speech model (VITS)☆55Mar 18, 2025Updated 11 months ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆268Jan 13, 2025Updated last year
- phone inventory library☆17May 15, 2023Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- StyleTTS 2 Optimized Training Fork☆33Feb 2, 2025Updated last year
- Text-to-Speech Latency Benchmark☆22Jan 16, 2026Updated last month
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆114Jun 10, 2023Updated 2 years ago
- The grapheme to phoneme model converts Kazakh(Arab|Cyrillic) characters to phonemes.☆12Sep 30, 2019Updated 6 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆52May 22, 2025Updated 9 months ago
- XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)☆347Jul 22, 2024Updated last year
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆16Feb 1, 2026Updated last month
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆22Jun 5, 2025Updated 9 months ago
- Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"☆68Dec 13, 2021Updated 4 years ago
- VietASR - Vietnamese Automatic Speech Recognition☆164Oct 29, 2024Updated last year
- ViSen is library to format tone of Vietnamese sentences☆20Nov 9, 2021Updated 4 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆147Apr 5, 2024Updated last year
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 11 months ago
- Sequence algorithms for use in Flashlight.☆14Jan 12, 2026Updated last month
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- visual-text to speech☆14Apr 3, 2022Updated 3 years ago
- python script to download & process data to train a speech-synthesis model of Vietnamese M.C. Nguyễn Ngọc Ngạn☆14Aug 13, 2024Updated last year
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆105Sep 3, 2021Updated 4 years ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆125Jun 16, 2022Updated 3 years ago
- High quality text-to-speech based on StyleTTS 2.☆73Feb 25, 2026Updated last week
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- 22人で童謡を5曲ずつ歌ってつくった歌唱データベースです。☆14Aug 7, 2022Updated 3 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago