wildminder/awesome-ai-voice

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wildminder/awesome-ai-voice)

wildminder / awesome-ai-voice

List of open-source TTS, voice cloning, and music generation models

☆362

Alternatives and similar repositories for awesome-ai-voice

Users that are interested in awesome-ai-voice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sarulab-speech / xvector_jtubespeech
View on GitHub
xvector model on jtubespeech
☆47Nov 5, 2023Updated 2 years ago
Btr4k / bugbounty-agent
View on GitHub
Automated bug bounty reconnaissance and scanning agent
☆45Jun 20, 2026Updated 2 weeks ago
0417keito / UTAUTAI
View on GitHub
UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)
☆16Oct 27, 2023Updated 2 years ago
pengzhendong / compute-wer
View on GitHub
Compute WER and SER for speech recognition evaluation
☆26Jun 6, 2026Updated last month
all-my-frontend-mini-projects / Ping-coming-soon-page_frontend_project
View on GitHub
Get ready for the launch of Ping with this sleek and modern coming soon page! Users can view the optimal layout for the site depending on…
☆10Oct 10, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
maxmekiska / micro-templates
View on GitHub
Repository to host micro service implementation patterns.
☆14Jun 25, 2025Updated last year
sarulab-speech / audio-foundation-model-dataset
View on GitHub
☆65Jan 8, 2025Updated last year
profbernardoj / everclaw-community-branches
View on GitHub
Decentralized AI inference for OpenClaw agents. Powered by Morpheus AI. Stake MOR, access Kimi K2.5 + 10 models, never run out of inferen…
☆109Jun 20, 2026Updated 2 weeks ago
zile42O / ads-tgbot
View on GitHub
Telegram bot for advertising (forwarding/sending) messages to all joined groups automatically by interval
☆12Oct 16, 2023Updated 2 years ago
unconv / calorieapp
View on GitHub
GPT-4o Powered Calorie Detecor
☆18May 29, 2024Updated 2 years ago
Jellypod-Inc / speech-sdk
View on GitHub
Universal Text-To-Speech TypeScript SDK with Multi-Provider Support.
☆43Updated this week
Bradleykingz / automated-postgres-backups-with-node
View on GitHub
Automatically backing up your Postgres database using NodeJS
☆13Nov 14, 2020Updated 5 years ago
vadimcro / VKRiez-Edge
View on GitHub
VKriez Edge Preprocessors nodes for ComfyUI
☆16Mar 18, 2025Updated last year
unilight / jatts
View on GitHub
JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit
☆43Mar 13, 2026Updated 3 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
elder-plinius / NATURALIS-FUTURA
View on GitHub
latent encyclopedia
☆52Mar 21, 2026Updated 3 months ago
thuytv-gl / fabric-CJK-vertical
View on GitHub
☆10Jan 18, 2024Updated 2 years ago
nithin-developer / Twitter-Scraper-Telegram-Bot
View on GitHub
The Twitter Scraper Telegram Bot is a Python-based bot developed to scrape tweets from influencers' Twitter accounts and send them via Te…
☆15Jun 25, 2023Updated 3 years ago
Mellow-Artificial-Intelligence / openextract
View on GitHub
Extract structured data from documents, images, audio, and video using LLMs.
☆18Jun 23, 2026Updated 2 weeks ago
airscholar / Japan-visa-data-engineering
View on GitHub
This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark cluste…
☆11Oct 11, 2023Updated 2 years ago
luqasn / aws-sandbox
View on GitHub
Transparent sandbox for integration testing against AWS services. Test your infrastructure without changes to your Terraform files or you…
☆12Oct 26, 2023Updated 2 years ago
elder-plinius / ENTHEA
View on GitHub
real-time psychedelic visual synthesizer and pro-grade music visualizer
☆172Jun 5, 2026Updated last month
pipecat-ai / stt-benchmark
View on GitHub
Benchmarking STT service TTFB and semantic WER for real-time AI applications
☆84Jun 22, 2026Updated 2 weeks ago
jeka-kiselyov / dimeshift-desktop
View on GitHub
DimeShift desktop application
☆17Feb 25, 2017Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
10h30 / kazewp
View on GitHub
KazeWP is a simple and flexible tool for managing multiple WordPress sites behind a Caddy reverse proxy server. Built with Docker and Bas…
☆17Apr 28, 2025Updated last year
ongxuanhong / de02-pyspark-optimization
View on GitHub
☆14Mar 11, 2023Updated 3 years ago
thuytv-scuti / fabric-CJK-vertical
View on GitHub
☆13Jan 6, 2022Updated 4 years ago
waheeb71 / WiFi-Hacking-Tool
View on GitHub
WiFi Hacking Tool is a powerful Python-based tool designed for ethical hacking, network analysis, and Wi-Fi penetration testing. It provi…
☆17May 9, 2026Updated 2 months ago
KathyReid / opensource-voice-tools
View on GitHub
A repo listing known open source voice tools, ordered by where they sit in the voice stack
☆28Sep 23, 2022Updated 3 years ago
leohuang2013 / pyannote-audio_speaker-diarization_cpp
View on GitHub
C++ version of pyannote audio speaker diarizaiton pipeline
☆22Feb 14, 2024Updated 2 years ago
Open-Live-Mixing-System-OLMS / Open-Live-Mixing-System
View on GitHub
Don't buy a mixer. Build one. OLMS: The Open Live Mixing System. Transforms a generic Mini-PC into a dedicated, professional Rack Digita…
☆34Feb 18, 2026Updated 4 months ago
EsadCetiner / Secure-Nginx-Config
View on GitHub
Fast and Secure by default Nginx configuration template
☆25Jan 6, 2026Updated 6 months ago
Q42 / fabricjs-opentypejs-demo
View on GitHub
Demo of fabricjs rendering fonts using opentypejs
☆21Sep 30, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AmphionTeam / SpeechJudge
View on GitHub
SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)
☆77Dec 23, 2025Updated 6 months ago
unconv / gpt4v-examples
View on GitHub
Example use cases for the GPT-4 Vision API
☆19Nov 26, 2023Updated 2 years ago
mwanago / nestjs-dockerized
View on GitHub
☆17Mar 10, 2023Updated 3 years ago
Aratako / Irodori-TTS
View on GitHub
A Flow Matching-based Text-to-Speech Model with Emoji-driven Style Control
☆1,014Jun 4, 2026Updated last month
kturung / Langgraph-Multi-Agent-HITL-Form
View on GitHub
☆23Mar 2, 2025Updated last year
civen-cn / ComfyUI_SparkTTS
View on GitHub
ComfyUI_SparkTTS
☆16Mar 10, 2025Updated last year
sevagh / xumx-sliCQ
View on GitHub
music demixing with the sliCQ Transform and PyTorch
☆33Nov 10, 2023Updated 2 years ago