rany2/edge-tts

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rany2/edge-tts)

rany2 / edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

☆11,540

Alternatives and similar repositories for edge-tts

Users that are interested in edge-tts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

2noise / ChatTTS
View on GitHub
A generative speech model for daily dialogue.
☆39,651Apr 10, 2026Updated 3 months ago
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,332Jun 9, 2026Updated last month
RVC-Boss / GPT-SoVITS
View on GitHub
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
☆59,977Jul 13, 2026Updated last week
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,783Aug 16, 2024Updated last year
FunAudioLLM / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,292May 25, 2026Updated last month
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
myshell-ai / MeloTTS
View on GitHub
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
☆7,544Dec 24, 2024Updated last year
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,364Updated this week
netease-youdao / EmotiVoice
View on GitHub
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
☆8,491Aug 13, 2024Updated last year
suno-ai / bark
View on GitHub
🔊 Text-Prompted Generative Audio Model
☆39,201Aug 19, 2024Updated last year
SWivid / F5-TTS
View on GitHub
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆14,981Jul 5, 2026Updated 2 weeks ago
SYSTRAN / faster-whisper
View on GitHub
Faster Whisper transcription with CTranslate2
☆24,400Nov 19, 2025Updated 8 months ago
FunAudioLLM / SenseVoice
View on GitHub
Open-source SenseVoiceSmall model for Mandarin, Cantonese, English, Japanese, and Korean ASR, language ID, emotion recognition, and audio…
☆8,902Updated this week
myshell-ai / OpenVoice
View on GitHub
Instant voice cloning by MIT and MyShell. Audio foundation model.
☆36,984Apr 19, 2025Updated last year
jianchang512 / ChatTTS-ui
View on GitHub
一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with su…
☆7,622Jun 14, 2026Updated last month
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jianchang512 / pyvideotrans
View on GitHub
Translate the video from one language to another and embed dubbing & subtitles.
☆18,386Updated this week
songquanpeng / one-api
View on GitHub
LLM API 管理 & 分发系统，支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型，统一 API 适配，可用于 key …
☆35,836Jan 9, 2026Updated 6 months ago
OpenTalker / SadTalker
View on GitHub
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
☆13,959Jun 26, 2024Updated 2 years ago
k2-fsa / sherpa-onnx
View on GitHub
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…
☆13,671Updated this week
Huanshere / VideoLingo
View on GitHub
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切…
☆17,792Jul 2, 2026Updated 2 weeks ago
openai / whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆105,288Apr 15, 2026Updated 3 months ago
huggingface / parler-tts
View on GitHub
Inference and training library for high-quality TTS models.
☆5,582Dec 10, 2024Updated last year
open-mmlab / Amphion
View on GitHub
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…
☆9,957Mar 25, 2026Updated 3 months ago
labring / FastGPT
View on GitHub
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…
☆29,040Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jianchang512 / clone-voice
View on GitHub
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具，使用你的音色或任意声音来录制音频
☆8,980Aug 29, 2025Updated 10 months ago
lobehub / lobehub
View on GitHub
🤯 LobeHub is your Chief Agent Operator, organizing your agents into 7×24 operations by hiring, scheduling, and reporting on your entire …
☆80,571Updated this week
ChatGPTNextWeb / NextChat
View on GitHub
✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows
☆88,517Jul 6, 2026Updated 2 weeks ago
PaddlePaddle / PaddleSpeech
View on GitHub
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text fronten…
☆12,649Jun 21, 2026Updated last month
index-tts / index-tts
View on GitHub
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
☆22,015Jul 14, 2026Updated last week
lipku / LiveTalking
View on GitHub
Real time interactive streaming digital human
☆8,450Updated this week
langgenius / dify
View on GitHub
Build Agentic workflows, RAG pipelines, with rich AI model and tool support on one collaborative workspace. Deploy on cloud, VPC, or self…
☆149,503Updated this week
m-bain / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆23,143Jul 13, 2026Updated last week
OpenTalker / video-retalking
View on GitHub
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
☆7,269Aug 5, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
SparkAudio / Spark-TTS
View on GitHub
Spark-TTS Inference Code
☆11,000Apr 9, 2025Updated last year
modelscope / FunClip
View on GitHub
FunASR-powered video transcription, subtitle generation, and LLM-assisted clipping tool with a local Gradio UI.
☆6,020Updated this week
datalab-to / surya
View on GitHub
OCR, layout analysis, reading order, table recognition in 90+ languages
☆21,123Updated this week
Migushthe2nd / MsEdgeTTS
View on GitHub
A simple Azure Speech Service module that uses the Microsoft Edge Read Aloud API. https://www.npmjs.com/package/msedge-tts
☆335Jul 9, 2026Updated last week
Comfy-Org / ComfyUI
View on GitHub
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
☆121,540Updated this week
chidiwilliams / buzz
View on GitHub
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
☆20,257Updated this week
travisvn / openai-edge-tts
View on GitHub
Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs
☆1,993Jul 1, 2025Updated last year