List of open-source TTS, voice cloning, and music generation models
☆186Apr 17, 2026Updated last week
Alternatives and similar repositories for awesome-ai-voice
Users that are interested in awesome-ai-voice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of small dash apps which I created for learning purposes. Some of them answer questions asked on the plotly forum. https://c…☆11Feb 8, 2024Updated 2 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆20Apr 10, 2025Updated last year
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆15Oct 27, 2023Updated 2 years ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated last month
- Decentralized AI inference for OpenClaw agents. Powered by Morpheus AI. Stake MOR, access Kimi K2.5 + 10 models, never run out of inferen…☆108Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Marketplace ML experiment - training without backprop☆27Sep 9, 2025Updated 7 months ago
- ☆62Jan 8, 2025Updated last year
- This repository extends the mask editor in Comfyui and supports lasso method for applying masks☆14Jul 23, 2025Updated 9 months ago
- Benchmarking STT service TTFB and semantic WER for real-time AI applications☆57Mar 20, 2026Updated last month
- Find out how to use SchemaCrawler AI MCP Server☆24Updated this week
- 🛡️ Detect and respond to security threats in real-time with God-Eye, an AI-driven tool designed for privacy and local deployment on mult…☆39Updated this week
- OmniVoice TTS nodes for ComfyUI - Zero-shot multilingual text-to-speech with voice cloning, voice design, and multi-speaker dialogue☆353Updated this week
- ☆23Apr 29, 2025Updated last year
- ☆21Nov 4, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Don't buy a mixer. Build one. OLMS: The Open Live Mixing System. Transforms a generic Mini-PC into a dedicated, professional Rack Digita…☆29Feb 18, 2026Updated 2 months ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆28Apr 23, 2024Updated 2 years ago
- ☆31Jan 2, 2026Updated 3 months ago
- One command to uninstall ALL Claw-family AI agents. Zero residue. 一键卸载所有 Claw 家族 AI Agent,无残留。☆70Mar 15, 2026Updated last month
- Sputnik DAO v2 utils☆10Jun 20, 2022Updated 3 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆28Sep 23, 2022Updated 3 years ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆22Feb 14, 2024Updated 2 years ago
- An asynchronous voice agent for WhatsApp built with ElevenLabs, Twilio, and Hono, running on Cloudflare 🔥☆29Jun 12, 2025Updated 10 months ago
- An Opencode plugin that makes GLM & Big Pickle ultrathink at all times☆35Dec 20, 2025Updated 4 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆71Dec 23, 2025Updated 4 months ago
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆32Jan 19, 2024Updated 2 years ago
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆55Sep 25, 2023Updated 2 years ago
- music demixing with the sliCQ Transform and PyTorch☆34Nov 10, 2023Updated 2 years ago
- ComfyUI_SparkTTS☆16Mar 10, 2025Updated last year
- ☆21Oct 9, 2024Updated last year
- A lightweight, user-friendly web editor for SillyTavern Presets.☆23Sep 22, 2025Updated 7 months ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- ☆10Oct 18, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Mar 26, 2026Updated last month
- An ultra-minimalistic VSCode icon set for modern devs.☆20May 10, 2025Updated 11 months ago
- Colab notebooks for Next-gen Kaldi☆32Oct 12, 2025Updated 6 months ago
- A Model Context Protocol (MCP) integration that provides Claude Desktop with autonomous browser automation capabilities. This agent enabl…☆37Feb 5, 2026Updated 2 months ago
- MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, i…☆194Apr 13, 2026Updated 2 weeks ago
- PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.☆65Sep 8, 2025Updated 7 months ago
- xvector model on jtubespeech☆47Nov 5, 2023Updated 2 years ago