Azure-Samples / cognitive-services-speech-sdkLinks
Sample code for the Microsoft Cognitive Services Speech SDK
☆3,352Updated last week
Alternatives and similar repositories for cognitive-services-speech-sdk
Users that are interested in cognitive-services-speech-sdk are comparing it to the libraries listed below
Sorting:
- Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.☆994Updated last month
- This sample shows how to integrate the Azure Speech service into a sample React application. This sample shows design pattern examples fo…☆160Updated 2 years ago
- The repository for all Azure OpenAI Samples complementing the OpenAI cookbook.☆1,291Updated last week
- A simple example implementation of the VoiceRAG pattern to power interactive voice generative AI experiences using RAG with Azure AI Sear…☆526Updated 2 weeks ago
- Welcome to the Microsoft Voice Assistant samples repository! Here you will find samples to help you get started building client applicati…☆122Updated 2 years ago
- Python interface to the WebRTC Voice Activity Detector☆2,405Updated last year
- The official Python SDK for the ElevenLabs API.☆2,800Updated this week
- Examples of how to use or integrate DeepSpeech☆857Updated 2 years ago
- Vision AI Solution Accelerator☆435Updated 6 months ago
- ☆2,430Updated 10 months ago
- 微软 tts 文本转语音 音频下载☆922Updated 8 months ago
- PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html☆2,188Updated 2 months ago
- Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei …☆483Updated 3 years ago
- A simple web application for a OpenAI-enabled document search. This repo uses Azure OpenAI Service for creating embeddings vectors from d…☆853Updated last year
- Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, …☆1,562Updated last month
- GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code☆2,651Updated last year
- Unified-Modal Speech-Text Pre-Training for Spoken Language Processing☆1,414Updated last year
- Command line utility for forced alignment using Kaldi☆1,676Updated 2 weeks ago
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries☆1,203Updated 4 months ago
- KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-…☆526Updated last year
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆9,455Updated 3 months ago
- Sample code for a simple web chat experience through Azure OpenAI, including Azure OpenAI On Your Data.☆1,909Updated last week
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆7,512Updated last week
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆7,746Updated last year
- A copilot sample that uses python to ground the copilot responses in company data.☆285Updated last year
- A Solution Accelerator for the RAG pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models t…☆1,133Updated this week
- This is a repo for cognitive services REST API samples in 4 languages: C#, Java, Node.js, and Python.☆239Updated last year
- Use D-ID's live streaming API to stream a talking presenter☆204Updated 3 weeks ago
- http://www.facegood.cc☆1,899Updated 2 years ago
- Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!☆1,216Updated last year