coqui-ai/coqui-voice-pack

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/coqui-ai/coqui-voice-pack)

coqui-ai / coqui-voice-pack

🐸Coqui Dialogue Audio Pack contains more than 2000 audio files of synthetic human voices over dialogue created specifically for video games. The pack includes both male and female voices from >30 different voices, and all of the files can be used for commercial purposes (royalty free).

☆46

Alternatives and similar repositories for coqui-voice-pack

Users that are interested in coqui-voice-pack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
NeonGeckoCom / neon-tts-plugin-coqui
View on GitHub
Coqui AI TTS plugin
☆85Jul 2, 2025Updated last year
WangHelin1997 / Aty-TTS
View on GitHub
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
☆11May 14, 2025Updated last year
natlamir / DINet-UI
View on GitHub
Windows Forms user interface for making lip sync videos with DINet and OpenFace
☆26Oct 14, 2023Updated 2 years ago
lukassteinwender / avatair
View on GitHub
A pipeline to generate user-preferred photo-realistic avatars using stable-diffusion and bayesian-optimization.
☆18Jun 12, 2026Updated last month
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ryanrudes / YTTTS
View on GitHub
The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions
☆53Apr 1, 2021Updated 5 years ago
jasonppy / syllable-discovery
View on GitHub
Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model
☆35Aug 27, 2023Updated 2 years ago
erogol / ngi
View on GitHub
Fast trigram-indexed regex search for codebases — 2-6x faster than ripgrep
☆20Mar 24, 2026Updated 4 months ago
A7ocin / FaceAnimator
View on GitHub
☆12Aug 22, 2017Updated 8 years ago
chenxwh / seamless_communication
View on GitHub
Foundational Models for State-of-the-Art Speech and Text Translation
☆11Sep 13, 2023Updated 2 years ago
Algomancer / The-Daily-Train
View on GitHub
Training Models Daily
☆16Dec 19, 2023Updated 2 years ago
sbrunk / scalajs-tfjs
View on GitHub
Scala.js bindings for TensorFlow.js. Train and deploy ML models in your browser with Scala.
☆17Aug 12, 2018Updated 7 years ago
1rgs / token-trekker-rs
View on GitHub
☆13Mar 22, 2023Updated 3 years ago
josh-truong / AI-Upscaler
View on GitHub
Upscale images or videos using ESRGAN. TL;DR Converts low res to high res image quality.
☆10Aug 4, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
yamathcy / ISMIR2022J-POP
View on GitHub
Supplementary Materials of ISMIR 2022 paper "Analysis and detection of singing techniques in repertoires of J-POP solo singers" by Yuya Y…
☆23Apr 23, 2024Updated 2 years ago
aklos / gpt3-personal-assistant
View on GitHub
Interact with GPT-3 through speech
☆12Dec 12, 2022Updated 3 years ago
OpenVoiceOS / ovos-plugin-manager
View on GitHub
plugin manager for OpenVoiceOS , STT/TTS/Wakewords that can be used anywhere
☆14Updated this week
morelen17 / tts-papers
View on GitHub
List of papers about TTS / Список статей о TTS
☆10Dec 16, 2017Updated 8 years ago
sushant-t / tts-trainer
View on GitHub
Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…
☆30May 27, 2023Updated 3 years ago
kairess / colorizer
View on GitHub
Colorize black and white (grayscale) image (or video) to colorful color with OpenCV (DNN module)
☆15Oct 28, 2018Updated 7 years ago
ftyers / commonvoice-utils
View on GitHub
Linguistic processing for Common Voice
☆59Jan 18, 2024Updated 2 years ago
getmetal / chatbot
View on GitHub
Deploy a "chat with your data" bot in minutes.
☆20Mar 16, 2024Updated 2 years ago
coqui-ai / STT-models
View on GitHub
Open models for Coqui STT
☆153May 9, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
thorstenMueller / cTTS
View on GitHub
TTS Client for Coqui TTS server
☆13Jan 7, 2023Updated 3 years ago
boximator / boximator.github.io
View on GitHub
Website code for Boximator: Generating Rich and Dynamic Motions for Video Synthesis
☆17Feb 19, 2024Updated 2 years ago
yl4579 / PitchExtractor
View on GitHub
Deep Neural Pitch Extractor for Voice Conversion and TTS Training
☆151Aug 22, 2022Updated 3 years ago
oatsu-gh / SimpleEnunu
View on GitHub
Another ENUNU for enthusiasts and developers, easy to catch up with NNSVS
☆14Dec 12, 2025Updated 7 months ago
harveenchadha / bol
View on GitHub
Open Source Speech Inferencing Libary for Indic Languages
☆12Apr 11, 2022Updated 4 years ago
souvikg544 / TTS_Data_Maker
View on GitHub
Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…
☆28Mar 14, 2023Updated 3 years ago
CherokeeLanguage / cherokee-audio-data
View on GitHub
Cherokee Audio data
☆11Dec 24, 2023Updated 2 years ago
lstrgar / ss-phoneme-seg
View on GitHub
Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…
☆55Nov 4, 2022Updated 3 years ago
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
featherless-ai-integrations / featherless-deepdive
View on GitHub
☆21Feb 11, 2025Updated last year
jhdeov / interlingual-MFA
View on GitHub
Workflow for forced alignment between languages
☆25May 7, 2026Updated 2 months ago
chain-ml / council-writing-assistant
View on GitHub
A demo and tutorial for Council that implements a research writing assistant.
☆11Nov 10, 2023Updated 2 years ago
promax204 / manual-chm
View on GitHub
keep files
☆13Sep 22, 2017Updated 8 years ago
Archivoice / nnsvs-chinese-support
View on GitHub
Hed and supporting files for Chinese NNSVS Dataset Creation
☆13Oct 14, 2025Updated 9 months ago
ajaybati / miipher2.0
View on GitHub
Reimplementation of Miipher
☆30Aug 16, 2023Updated 2 years ago
interscript / rababa
View on GitHub
Rababa, the diacritization library for Arabic and Hebrew (Abjad scripts in general)
☆13May 1, 2025Updated last year