gooofy/zerovox

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/gooofy/zerovox)

gooofy / zerovox

zero-shot realtime TTS system, fully offline, free and open source

☆56

Alternatives and similar repositories for zerovox

Users that are interested in zerovox are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
ajd12342 / paraspeechclap
View on GitHub
Codebase for 'ParaSpeechCLAP: A Dual-Encoder Speech-Text Model for Rich Stylistic Language-Audio Pretraining'
☆23Jun 20, 2026Updated last month
alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
BiSinger-SVS / BiSinger
View on GitHub
Bilingual Singing Voice Synthesis
☆18Mar 25, 2024Updated 2 years ago
idiap / knn-tts
View on GitHub
Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model
☆36Apr 29, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Tikai7 / DiTTO-TTS
View on GitHub
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors
☆39Feb 11, 2025Updated last year
babe269 / performant
View on GitHub
A toolset for easy formant extraction and visualization from wav files and TTS models
☆33Sep 2, 2022Updated 3 years ago
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 5 months ago
samsad35 / code-ancogen
View on GitHub
[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
☆14Mar 11, 2025Updated last year
arnabdas8901 / StarGAN-VC_PlusPlus
View on GitHub
☆11Aug 11, 2023Updated 2 years ago
Ereboas / TacoLM
View on GitHub
☆19May 2, 2024Updated 2 years ago
declare-lab / HyperTTS
View on GitHub
☆40Apr 15, 2024Updated 2 years ago
andimarafioti / nano-parakeet
View on GitHub
Pure-PyTorch Parakeet TDT inference
☆51Mar 10, 2026Updated 4 months ago
reppy4620 / x-vits
View on GitHub
☆14Aug 1, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
avryhof / speech_recognition
View on GitHub
Speech recognition module for Python, supporting several engines and APIs, online and offline.
☆13Mar 9, 2022Updated 4 years ago
adelacvg / diff-vits
View on GitHub
☆39Oct 1, 2023Updated 2 years ago
mikex86 / DeepSpeech-Java-Bindings
View on GitHub
Java Bindings for the C++ library DeepSpeech
☆10Jun 4, 2020Updated 6 years ago
hbwu-ntu / EmoCtrlTTS-Eval
View on GitHub
☆19Aug 23, 2024Updated last year
ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
42io / tflite_kws
View on GitHub
☆13May 1, 2026Updated 2 months ago
yucefsourani / albasheer-electronic-quran-browser
View on GitHub
Albasheer electronic quran browser
☆24Jan 17, 2026Updated 6 months ago
shinhyeokoh / rwen
View on GitHub
☆14Jun 16, 2023Updated 3 years ago
utter-project / mHuBERT-147-scripts
View on GitHub
Collection of scripts from mHuBERT-147.
☆35Nov 19, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
daanzu / wenet_stt_python
View on GitHub
☆33Nov 27, 2021Updated 4 years ago
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
ikegami-yukino / sengiri
View on GitHub
Yet another sentence-level tokenizer for the Japanese text
☆24Nov 27, 2025Updated 8 months ago
hcy71o / SC-CNN
View on GitHub
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems
☆39Nov 1, 2023Updated 2 years ago
TigreGotico / chatterbox-onnx
View on GitHub
chatterbox TTS + Voice Clone using onnx
☆28Jul 20, 2026Updated last week
hongfeixue / StutteringSpeechChallenge
View on GitHub
SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
☆12Jun 11, 2024Updated 2 years ago
primepake / dac_vae
View on GitHub
Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder
☆38Aug 30, 2025Updated 11 months ago
Scicom-AI-Enterprise-Organization / Multilingual-TTS
View on GitHub
Building actual open source including dataset Multilingual TTS more than 150 languages with Voice Cloning.
☆56Jul 14, 2026Updated 2 weeks ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
talker93 / oneMinTTS
View on GitHub
Launch your speech synthesis within one minute.
☆12May 6, 2024Updated 2 years ago
lars76 / fastspeech2-clean
View on GitHub
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
☆18Aug 16, 2024Updated last year
vTAD2025-Challenge / vTAD
View on GitHub
☆17Oct 24, 2025Updated 9 months ago
WhissleAI / PromptingNemo
View on GitHub
All-in-one Speech Transcription
☆11Jun 5, 2026Updated last month
fkhannouf / BGUI
View on GitHub
BGUI stands for BOOPSI Graphical User Interface. BGUI is free GUI toolkit for the Amiga OS.
☆11Apr 29, 2025Updated last year
Mobile-Artificial-Intelligence / babylon
View on GitHub
Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…
☆39Apr 14, 2026Updated 3 months ago
cnaigithub / Auto_Tuning_Zeroshot_TTS_and_VC
View on GitHub
Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…
☆80May 29, 2023Updated 3 years ago