iwater / Real-Time-Voice-Cloning-Chinese
Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆34Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Real-Time-Voice-Cloning-Chinese
- (已过时)WaveNet 声码器☆21Updated 4 years ago
- style token with tacotron2☆61Updated last year
- PyTorch reimplementation of Tacotron2 in Mandarin☆80Updated 3 years ago
- chinese tts☆74Updated 3 years ago
- tacotron-2(pytorch) + wavernn(pytorch) chinese TTS☆33Updated last year
- ☆15Updated 5 years ago
- tacotron-2(pytorch) + melgan(pytorch) chinese TTS☆26Updated last year
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Updated 5 years ago
- ☆33Updated 2 years ago
- Huawei Grad-TTS for Chinese☆45Updated last year
- The Implementation of FastSpeech2 Based on Pytorch.☆52Updated last year
- Chinese and English Bilinguish G2P☆20Updated last year
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆253Updated 5 years ago
- 基于 g2pW 提升 pypinyin 的准确性☆78Updated last year
- Forked from NVIDIA/tacotron2 and merged with Rayhane-mamah/Tacotron-2☆81Updated 4 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆69Updated 3 years ago
- (pytorch) multi speaker TTS,☆65Updated 5 years ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆52Updated this week
- Encoder and Decoder and Attention Based Prosody Prediction☆68Updated 6 years ago
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆61Updated 2 years ago
- Python interface to the WebRTC Noise Suppression☆18Updated 2 years ago
- ☆65Updated last year
- ☆28Updated 4 years ago
- TTS model based on Transformer.☆57Updated 5 years ago
- WaveRNN Vocoder + TTS☆16Updated 4 years ago
- A Demo of Mandarin/Chinese TTS frontend☆277Updated 2 years ago
- 这个工程的目的是从视频中获取语音识别的训练数据,用于训练字幕自动生成☆53Updated 6 years ago