Azure-Samples / cognitive-services-speech-sdk
Sample code for the Microsoft Cognitive Services Speech SDK
☆3,123Updated this week
Alternatives and similar repositories for cognitive-services-speech-sdk:
Users that are interested in cognitive-services-speech-sdk are comparing it to the libraries listed below
- Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.☆951Updated 2 weeks ago
- Microsoft Azure Cognitive Services Speech SDK for JavaScript☆288Updated this week
- Welcome to the Microsoft Voice Assistant samples repository! Here you will find samples to help you get started building client applicati…☆112Updated last year
- This sample shows how to integrate the Azure Speech service into a sample React application. This sample shows design pattern examples fo…☆150Updated last year
- The sample app and documentation of the Microsoft Speech Devices SDK.☆18Updated 3 years ago
- An unofficial PyTorch implementation of the audio LM VALL-E☆2,989Updated last year
- Unified-Modal Speech-Text Pre-Training for Spoken Language Processing☆1,320Updated 11 months ago
- Vision AI Solution Accelerator☆425Updated 2 months ago
- The repository for all Azure OpenAI Samples complementing the OpenAI cookbook.☆1,182Updated last month
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone☆953Updated 4 months ago
- Azure OpenAI code resources for using gpt-4o-realtime capabilities.☆791Updated this week
- Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei …☆471Updated 2 years ago
- Sample to envision intelligent apps with Microsoft's Copilot stack for AI-infused product experiences.☆739Updated 8 months ago
- This is a repo for cognitive services REST API samples in 4 languages: C#, Java, Node.js, and Python.☆236Updated last year
- Riva Python client API and CLI utils☆88Updated this week
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,572Updated 11 months ago
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node☆9,042Updated 2 weeks ago
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries☆1,007Updated 6 months ago
- OpenAI Whisper ASR Webservice API☆2,453Updated last month
- KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-…☆501Updated last year
- PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html☆2,107Updated last year
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆7,094Updated last week
- Command Line tool and Windows application for document translation, a local interface to the Azure Document Translation service for Windo…☆163Updated 3 months ago
- A book about Text-to-Speech (TTS) in Chinese.☆595Updated 2 years ago
- Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.☆314Updated last month
- Samples for working with Azure OpenAI Service☆401Updated last year
- AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head☆10,113Updated 8 months ago
- Multilingual Voice Understanding Model☆5,025Updated 2 months ago
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆9,049Updated this week
- Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, …☆1,233Updated 2 months ago