Azure-Samples / cognitive-services-speech-sdkLinks
Sample code for the Microsoft Cognitive Services Speech SDK
☆3,199Updated this week
Alternatives and similar repositories for cognitive-services-speech-sdk
Users that are interested in cognitive-services-speech-sdk are comparing it to the libraries listed below
Sorting:
- Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.☆959Updated 3 weeks ago
- Microsoft Azure Cognitive Services Speech SDK for JavaScript☆302Updated this week
- This sample shows how to integrate the Azure Speech service into a sample React application. This sample shows design pattern examples fo…☆155Updated last year
- Welcome to the Microsoft Voice Assistant samples repository! Here you will find samples to help you get started building client applicati…☆117Updated last year
- The sample app and documentation of the Microsoft Speech Devices SDK.☆18Updated 3 years ago
- Multilingual Automatic Speech Recognition with word-level timestamps and confidence☆2,425Updated 2 months ago
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆5,930Updated 2 months ago
- This is a repo for cognitive services REST API samples in 4 languages: C#, Java, Node.js, and Python.☆238Updated last year
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,605Updated last year
- A highly-customizable web-based client for Azure Bot Services.☆1,682Updated this week
- Unified-Modal Speech-Text Pre-Training for Spoken Language Processing☆1,363Updated last year
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node☆9,848Updated last month
- A nearly-live implementation of OpenAI's Whisper.☆2,936Updated this week
- Faster Whisper transcription with CTranslate2☆16,408Updated this week
- On-device wake word detection powered by deep learning☆4,147Updated last week
- Python interface to the WebRTC Voice Activity Detector☆2,254Updated 11 months ago
- We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new …☆1,292Updated last year
- Real time transcription with OpenAI Whisper.☆2,732Updated last month
- Whisper realtime streaming for long speech-to-text transcription and translation☆2,947Updated 5 months ago
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries☆1,059Updated last week
- Examples of how to use or integrate DeepSpeech☆851Updated last year
- Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/☆813Updated last year
- Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei …☆473Updated 3 years ago
- [ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching☆1,017Updated last week
- Microsoft Graph Communications Samples☆223Updated this week
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆7,633Updated this week
- Open Text to Speech Server☆1,057Updated last year
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germa…☆3,931Updated 11 months ago
- A new way for developers to exchange card content in a common and consistent way.☆1,842Updated last week
- Simple text to phones converter for multiple languages☆1,389Updated 8 months ago