Azure-Samples / Cognitive-Speech-TTSLinks
Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
☆959Updated 3 weeks ago
Alternatives and similar repositories for Cognitive-Speech-TTS
Users that are interested in Cognitive-Speech-TTS are comparing it to the libraries listed below
Sorting:
- Sample code for the Microsoft Cognitive Services Speech SDK☆3,199Updated this week
- Microsoft Azure Cognitive Services Speech SDK for JavaScript☆302Updated this week
- Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei …☆473Updated 3 years ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆2,033Updated last year
- Command line utility for forced alignment using Kaldi☆1,493Updated this week
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,606Updated last year
- Simple text to phones converter for multiple languages☆1,389Updated 8 months ago
- Welcome to the Microsoft Voice Assistant samples repository! Here you will find samples to help you get started building client applicati…☆117Updated last year
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone☆974Updated 7 months ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆401Updated 5 years ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆690Updated 2 years ago
- Allosaurus is a pretrained universal phone recognizer for more than 2000 languages☆630Updated last year
- The Implementation of FastSpeech based on pytorch.☆871Updated last year
- 🐸 collection of TTS papers☆693Updated 11 months ago
- Python interface to the WebRTC Voice Activity Detector☆2,254Updated 11 months ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.☆836Updated last year
- g2p: English Grapheme To Phoneme Conversion☆853Updated 2 years ago
- Examples of how to use or integrate DeepSpeech☆851Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.☆354Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆2,149Updated 10 months ago
- This repository has implementation for "Neural Voice Cloning With Few Samples"☆436Updated 4 years ago
- Unified-Modal Speech-Text Pre-Training for Spoken Language Processing☆1,363Updated last year
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,327Updated last year
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆360Updated 2 years ago
- Audio Slicer that uses silence detection to split .wav audio files into multiple .wav samples.☆301Updated last year
- vits chinese, tts chinese, tts mandarin 史上训练最简单,音质最好的语音合成系统☆212Updated 3 years ago
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.☆581Updated last year
- ☆1,432Updated last year
- This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.☆1,224Updated 10 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆327Updated 6 months ago