Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.
☆23Mar 14, 2019Updated 7 years ago
Alternatives and similar repositories for CS-Tacotron-Pytorch
Users that are interested in CS-Tacotron-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.☆29Mar 14, 2019Updated 7 years ago
- tacotron for research on Chinese speech synthesis and Taiwanese speech synthesis from Chinese input text sequence with different granular…☆25Aug 2, 2018Updated 7 years ago
- TPSE-GST Tacotron2☆14May 1, 2019Updated 6 years ago
- TTS前,文本标准化,将数字字母处理转化为汉字☆12Apr 27, 2024Updated last year
- chinese_tacotron-2☆12Feb 27, 2018Updated 8 years ago
- ☆15May 8, 2021Updated 4 years ago
- A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis☆115Dec 2, 2020Updated 5 years ago
- Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…☆24Dec 8, 2019Updated 6 years ago
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆27May 25, 2023Updated 2 years ago
- An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" …☆114Jun 19, 2020Updated 5 years ago
- TTS model based on Transformer.☆58Aug 2, 2019Updated 6 years ago
- Korean Emotional End-to-End Neural Speech synthesizer, ML4audio, NIPS2017☆72Aug 22, 2019Updated 6 years ago
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆34Aug 11, 2020Updated 5 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Jul 23, 2018Updated 7 years ago
- ☆37May 8, 2021Updated 4 years ago
- Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization☆194Jul 12, 2024Updated last year
- Code for the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.☆21Mar 22, 2017Updated 9 years ago
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Jun 14, 2024Updated last year
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44May 9, 2023Updated 2 years ago
- ☆22Apr 4, 2023Updated 2 years ago
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- The code for aishell-3 baseline acoustic model☆69Nov 30, 2020Updated 5 years ago
- GPT for FACodec☆13Mar 25, 2024Updated last year
- Converts Mandarin Chinese pinyin notation to IPA (international phonetic alphabet) notation☆18Nov 28, 2023Updated 2 years ago
- 论文复现,使用pos标记进行中文多音字消歧☆21Jul 20, 2019Updated 6 years ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated 2 years ago
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"☆13Aug 29, 2023Updated 2 years ago
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆151Feb 11, 2023Updated 3 years ago
- Alignment files of LibriTTS.☆67Mar 16, 2020Updated 6 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Jun 24, 2019Updated 6 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.☆845Oct 10, 2023Updated 2 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- Simple ALSA Wrapper for .NET Standard☆13Oct 16, 2025Updated 5 months ago
- 拼音转汉字, convert pinyin to 汉字 using deep networks☆23Sep 18, 2020Updated 5 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago