Azure-Samples / cognitive-services-speech-sdkLinks
Sample code for the Microsoft Cognitive Services Speech SDK
☆3,231Updated this week
Alternatives and similar repositories for cognitive-services-speech-sdk
Users that are interested in cognitive-services-speech-sdk are comparing it to the libraries listed below
Sorting:
- Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.☆964Updated last week
- Microsoft Azure Cognitive Services Speech SDK for JavaScript☆308Updated this week
- This sample shows how to integrate the Azure Speech service into a sample React application. This sample shows design pattern examples fo…☆157Updated last year
- Azure OpenAI code resources for using gpt-4o-realtime capabilities.☆821Updated 3 weeks ago
- 微软 tts 文本转语音 音频下载☆907Updated 3 months ago
- Unified-Modal Speech-Text Pre-Training for Spoken Language Processing☆1,374Updated last year
- Python interface to the WebRTC Voice Activity Detector☆2,290Updated last year
- Welcome to the Microsoft Voice Assistant samples repository! Here you will find samples to help you get started building client applicati…☆117Updated last year
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆6,199Updated 3 weeks ago
- The repository for all Azure OpenAI Samples complementing the OpenAI cookbook.☆1,238Updated last week
- Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/☆817Updated last year
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germa…☆3,949Updated last year
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆2,052Updated last year
- Command line utility for forced alignment using Kaldi☆1,525Updated this week
- PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html☆2,150Updated 2 weeks ago
- A simple example implementation of the VoiceRAG pattern to power interactive voice generative AI experiences using RAG with Azure AI Sear…☆465Updated 3 weeks ago
- Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, …☆1,391Updated last month
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-speech☆360Updated last year
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries☆1,123Updated last month
- Vision AI Solution Accelerator☆432Updated last month
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…☆6,583Updated this week
- OpenAI Whisper ASR Webservice API☆2,722Updated last week
- A PyTorch-based Speech Toolkit☆10,113Updated last week
- A python package to build AI-powered real-time audio applications☆1,352Updated 4 months ago
- ☆1,437Updated last year
- Examples of how to use or integrate DeepSpeech☆852Updated last year
- Offline speech recognition for Android with Vosk library.☆895Updated last year
- http://www.facegood.cc☆1,879Updated 2 years ago
- Offline Text To Speech synthesis for python☆2,368Updated this week
- Multilingual Automatic Speech Recognition with word-level timestamps and confidence☆2,493Updated 3 months ago