Jackson-Kang/Prosody-augmentation-for-Text-to-speech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Jackson-Kang/Prosody-augmentation-for-Text-to-speech)

Jackson-Kang / Prosody-augmentation-for-Text-to-speech

Simple tool for speech dataset augmentation for modeling various prosodies.

☆14

Alternatives and similar repositories for Prosody-augmentation-for-Text-to-speech

Users that are interested in Prosody-augmentation-for-Text-to-speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Jackson-Kang / Korean-phoneme-dictionary-generator
View on GitHub
Korean phoneme dictionary generator for training Montreal Forced Aligner (MFA)
☆13Feb 27, 2021Updated 5 years ago
Jackson-Kang / VQVC-Pytorch
View on GitHub
An unofficial implementation of Vector Quantization Voice Conversion (VQVC).
☆29Apr 12, 2021Updated 5 years ago
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
Jackson-Kang / Awesome-DL-based-Text-to-speech-Papers-and-Resources
View on GitHub
Various Text-to-speech (TTS) papers based on Deep-learning
☆14Feb 26, 2021Updated 5 years ago
jlian2 / Robust-Voice-Style-Transfer
View on GitHub
Demo for 2022 ICASSP
☆64Jun 14, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mt-upc / ZeroSwot
View on GitHub
Pushing the Limits of Zero-shot End-to-End Speech Translation
☆25Dec 12, 2024Updated last year
Jackson-Kang / MFARunner
View on GitHub
A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.
☆45May 25, 2023Updated 3 years ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
jefflai108 / Unsupervised-TTS
View on GitHub
☆42Mar 25, 2022Updated 4 years ago
revsic / torch-diffusion-wavegan
View on GitHub
Parallel waveform generation with DiffusionGAN
☆17Mar 26, 2022Updated 4 years ago
chomeyama / HN-UnifiedSourceFilterGAN
View on GitHub
☆88Nov 1, 2022Updated 3 years ago
xcmyz / CLONE
View on GitHub
☆20Jul 13, 2022Updated 4 years ago
ashi-ta / speechGLUE
View on GitHub
SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.
☆13Jun 2, 2023Updated 3 years ago
CODEJIN / Speaker_Embedding_Torch
View on GitHub
PyTorch based speaker embedding model
☆16Apr 13, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
wenet-e2e / WeSpeech-AI
View on GitHub
Open Source Speech/Text Data on AI
☆19Sep 13, 2022Updated 3 years ago
polvanrijn / VoiceMe
View on GitHub
Repository for the paper: VoiceMe: Personalized voice generation in TTS
☆125Apr 29, 2022Updated 4 years ago
LAION-AI / Text-to-speech
View on GitHub
☆61Nov 4, 2023Updated 2 years ago
HGU-DLLAB / Korean-FastSpeech2-Pytorch
View on GitHub
Implementation of Korean FastSpeech2
☆215Jan 29, 2023Updated 3 years ago
walker-hyf / NCSSD
View on GitHub
Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)
☆61Nov 1, 2024Updated last year
rishikksh20 / AdaSpeech2
View on GitHub
AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data
☆70Aug 31, 2021Updated 4 years ago
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
JSALT-2022-SSL / superb-prosody
View on GitHub
☆31Jul 13, 2023Updated 3 years ago
rishikksh20 / Phone-Level-Mixture-Density-Network-for-TTS
View on GitHub
Rich Prosody Diversity Modelling with Phone-level Mixture Density Network
☆45Dec 1, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ttaoREtw / semi-tts
View on GitHub
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
☆39Jul 16, 2020Updated 6 years ago
MTG / PodcastMix-inference
View on GitHub
☆32Jan 6, 2022Updated 4 years ago
unilight / LDNet
View on GitHub
Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"
☆68Dec 13, 2021Updated 4 years ago
X-LANCE / UniCATS-CTX-txt2vec
View on GitHub
[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS
☆64Nov 18, 2024Updated last year
nc-ai / speech
View on GitHub
☆17Aug 27, 2025Updated 11 months ago
fabianoluzbr / neural-g2p-portuguese
View on GitHub
Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…
☆19Jun 14, 2021Updated 5 years ago
anton-kashkin / hifi_vc
View on GitHub
☆25Jan 24, 2023Updated 3 years ago
MiscellaneousStuff / PhoneLM
View on GitHub
(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.
☆48Sep 4, 2023Updated 2 years ago
ditto-tts / ditto-tts.github.io
View on GitHub
Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer
☆38Feb 17, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
idebtor / JoyAI
View on GitHub
모두를 위한 인공지능의 활용(GEK10109-Joyful AI for ALL)
☆70Mar 3, 2024Updated 2 years ago
patrickvonplaten / audio-gen-dreambooth
View on GitHub
☆23Jun 13, 2023Updated 3 years ago
tarepan / VoiceConversionLab
View on GitHub
Collect Voice Conversion researches
☆97Updated this week
keonlee9420 / Robust_Fine_Grained_Prosody_Control
View on GitHub
PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis
☆41Feb 20, 2022Updated 4 years ago
sarulab-speech / multi-speaker-dgp
View on GitHub
Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
☆24Mar 23, 2021Updated 5 years ago
callee2006 / MachineLearning
View on GitHub
Deep learning and machine learning example codes for practice
☆18Jan 21, 2020Updated 6 years ago
rishikksh20 / NU-Wave2-pytorch
View on GitHub
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]
☆25Jul 5, 2022Updated 4 years ago