CookiePPP/cookietts

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CookiePPP/cookietts)

CookiePPP / cookietts

[Last Updated 2021] TTS from Cookie. Messy and experimental!

☆43

Alternatives and similar repositories for cookietts

Users that are interested in cookietts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CookiePPP / podcast_rss_feeds
View on GitHub
List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.
☆31Apr 13, 2023Updated 3 years ago
revsic / torch-nansy
View on GitHub
Torch implementation of NANSY, Neural Analysis and Synthesis, arXiv:2110.14513
☆64Feb 13, 2023Updated 3 years ago
noicevice / awesome-voice-cloning
View on GitHub
☆64Feb 5, 2021Updated 5 years ago
adelacvg / diff-vits
View on GitHub
☆39Oct 1, 2023Updated 2 years ago
hecko-yes / tts-dataset-prompts
View on GitHub
Finally, some decent sample sentences
☆24Dec 3, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
innnky / glow-svc
View on GitHub
singing voice conversion based on glow-tts
☆12Aug 20, 2023Updated 2 years ago
WX-Wei / HarmoF0
View on GitHub
☆108Aug 23, 2024Updated last year
hcy71o / AutoVocoder
View on GitHub
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
☆71Dec 2, 2022Updated 3 years ago
MaxMax2016 / max-vc
View on GitHub
singing voice conversion without f0
☆23May 10, 2023Updated 3 years ago
MTG / Podcastmix
View on GitHub
PodcastMix A dataset for separating music and speech in podcasts.
☆44Aug 20, 2024Updated last year
hcy71o / MB-iSTFT-VITS-with-AutoVocoder
View on GitHub
Incorporating AutoVocoder to MB-iSTFT-VITS
☆47Dec 1, 2022Updated 3 years ago
BridgetteSong / ExpressiveTacotron
View on GitHub
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…
☆74Sep 21, 2022Updated 3 years ago
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
MTG / PodcastMix-inference
View on GitHub
☆32Jan 6, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
chomeyama / SiFiGAN
View on GitHub
Official implementation of the source-filter HiFiGAN vocoder
☆275Jul 29, 2023Updated 3 years ago
Many0therFunctions / MaskGCT-Text-To-Semantic-Finetune
View on GitHub
This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …
☆13Dec 4, 2024Updated last year
okio-ai / nendo_plugin_musicgen
View on GitHub
Nendo plugin for MusicGen: A state-of-the-art controllable text-to-music model (by Meta Research)
☆17Mar 19, 2024Updated 2 years ago
ga642381 / RobustVC
View on GitHub
**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…
☆24Sep 27, 2022Updated 3 years ago
Pokerole-Software-Development / Pokerole-Obsidian-SRD
View on GitHub
A System Reference Document for Obsidian. Maintainer: Willowlark
☆11Feb 16, 2026Updated 5 months ago
Aratako / CALM-DACVAE
View on GitHub
An attempt to reproduce CALM (Continuous Audio Language Models) using DACVAE as the audio VAE.
☆18Feb 20, 2026Updated 5 months ago
tuanh123789 / AdaSpeech
View on GitHub
An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"
☆98Jun 7, 2022Updated 4 years ago
Verssae / flask-tacotron2-tts-web-app
View on GitHub
flask+tornado based NVIDIA tacotron2+waveglow tts web app
☆28May 25, 2023Updated 3 years ago
oatsu-gh / utau_renderer_with_diff_svc
View on GitHub
Render wav and convert it with [Diff-SVC](https://github.com/prophesier/diff-svc) model
☆10Aug 24, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
MaxMax2016 / Glow-SVC
View on GitHub
4G GPU & 10 Minutes for train
☆12Aug 9, 2023Updated 2 years ago
ubisoft / ubisoft-laforge-daft-exprt
View on GitHub
Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
☆127Apr 8, 2023Updated 3 years ago
ishine / PnG-BERT
View on GitHub
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS
☆24Jan 29, 2022Updated 4 years ago
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
liuhuang31 / HiFTNet-sr
View on GitHub
HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz
☆24Jan 2, 2024Updated 2 years ago
Chinch-Bug / clangen-genemod
View on GitHub
Clangen mod incorporating inheritable cat genetics
☆16Updated this week
rhoposit / multilingual_VQVAE
View on GitHub
☆37May 8, 2021Updated 5 years ago
innnky / diff-svc
View on GitHub
An Implementation of Singing Voice Conversion Based on Diffsinger
☆73Feb 20, 2023Updated 3 years ago
neosapience / editts
View on GitHub
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)
☆122Jan 24, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
GOLEM-lab / fandom-wiki
View on GitHub
Extraction of structured and unstructured information from fandom.com pages
☆29Feb 22, 2025Updated last year
rishikksh20 / NU-Wave2-pytorch
View on GitHub
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]
☆25Jul 5, 2022Updated 4 years ago
hs-oh-prml / DurFlexEVC
View on GitHub
☆82Jan 22, 2025Updated last year
Jackson-Kang / Prosody-augmentation-for-Text-to-speech
View on GitHub
Simple tool for speech dataset augmentation for modeling various prosodies.
☆14Jan 14, 2021Updated 5 years ago
yfyeung / DS-WED
View on GitHub
[ICASSP 2026] Official code for "Measuring Prosody Diversity in Zero-Shot TTS: A New Metric, Benchmark, and Exploration"
☆17Apr 16, 2026Updated 3 months ago
adelacvg / DPTTS
View on GitHub
An AR+AR TTS attempt.
☆18Jan 13, 2025Updated last year
neosapience / mlp-singer
View on GitHub
Official implementation of MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis (IEEE MLSP 2021)
☆118Feb 24, 2022Updated 4 years ago