Azure-Samples / cognitive-services-speech-sdk
Sample code for the Microsoft Cognitive Services Speech SDK
☆2,956Updated this week
Related projects ⓘ
Alternatives and complementary repositories for cognitive-services-speech-sdk
- Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.☆907Updated this week
- Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei …☆465Updated 2 years ago
- Python interface to the WebRTC Voice Activity Detector☆2,068Updated 4 months ago
- Offline Text To Speech synthesis for python☆2,143Updated this week
- 📣 商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide …☆509Updated 6 months ago
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆4,383Updated last week
- A small speech recognizer☆3,952Updated last month
- Unified-Modal Speech-Text Pre-Training for Spoken Language Processing☆1,208Updated 6 months ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germa…☆3,842Updated 4 months ago
- OpenAI Whisper ASR Webservice API☆2,119Updated last month
- KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-…☆494Updated 10 months ago
- Examples of how to use or integrate DeepSpeech☆821Updated last year
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆7,029Updated this week
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆6,334Updated this week
- Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training wit…☆888Updated 4 months ago
- SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.☆445Updated 2 months ago
- Command line utility for forced alignment using Kaldi☆1,346Updated last week
- 语音api示例☆690Updated 3 months ago
- A PyTorch-based Speech Toolkit☆8,950Updated last week
- Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, …☆1,065Updated 2 months ago
- Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高☆472Updated 8 months ago
- ☆1,386Updated 9 months ago
- an open-source implementation of sequence-to-sequence based speech processing engine☆952Updated last year
- 基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用 的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型☆817Updated this week
- Text Normalization & Inverse Text Normalization☆481Updated last week
- Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!☆1,163Updated 9 months ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,444Updated 7 months ago
- The official Python API for ElevenLabs Text to Speech.☆2,207Updated 3 weeks ago
- PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html☆2,052Updated last year