WelkinYang/EMPHASIS-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WelkinYang/EMPHASIS-pytorch)

WelkinYang / EMPHASIS-pytorch

EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System

☆15

Alternatives and similar repositories for EMPHASIS-pytorch

Users that are interested in EMPHASIS-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rsprouse / xray_microbeam_database
View on GitHub
Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)
☆14Oct 8, 2020Updated 5 years ago
cnlinxi / tpse_tacotron2
View on GitHub
TPSE-GST Tacotron2
☆14May 1, 2019Updated 7 years ago
xinyal / Gan-Speech-Synthesis-Research
View on GitHub
This is part of code of a research on speech synthesizing for a low-resourced language: Gan, a Chinese dialect spoken primarily in Jiangx…
☆17Sep 5, 2016Updated 9 years ago
mutiann / neural-lexicon-reader
View on GitHub
Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge
☆21Jul 25, 2022Updated 4 years ago
r9y9 / kiritan_singing
View on GitHub
Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.
☆28Dec 31, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rafaelvalle / asrgen
View on GitHub
Attacking Speaker Recognition with Deep Generative Models
☆34Mar 24, 2023Updated 3 years ago
ZackHodari / tts_data_tools
View on GitHub
Data processing tools for preparing speech and labels for training TTS voices
☆29Aug 13, 2020Updated 5 years ago
MiniXC / phones
View on GitHub
A collection of utilities for handling IPA phones.
☆27Sep 24, 2023Updated 2 years ago
v-nhandt21 / MusicVoiceConversion
View on GitHub
Sing any popular song with your voice
☆11Jul 10, 2022Updated 4 years ago
noetits / ICE-Talk
View on GitHub
Interface for Controllable Expressive Talking Machine
☆40Sep 20, 2025Updated 10 months ago
yuan1615 / AdaVocoder
View on GitHub
Adaptive Vocoder for Custom Voice
☆61Sep 22, 2022Updated 3 years ago
ZackHodari / average_prosody
View on GitHub
Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…
☆24Dec 8, 2019Updated 6 years ago
Connum / npm-pinyin2ipa
View on GitHub
Converts Mandarin Chinese pinyin notation to IPA (international phonetic alphabet) notation
☆19Nov 28, 2023Updated 2 years ago
candlewill / RawNet
View on GitHub
RawNet: Fast End-to-End Neural Vocoder
☆43May 29, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
insunhwang89 / StyleVC
View on GitHub
☆33Jan 14, 2023Updated 3 years ago
evinpinar / wavenet_pytorch
View on GitHub
Wavenet pytorch implementation for text-to-speech
☆19Jul 19, 2023Updated 3 years ago
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
huiw39 / ExtensibleTTS-PyTorch
View on GitHub
An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery
☆26Jun 24, 2019Updated 7 years ago
babua / TTSDatasetRecorder
View on GitHub
A simple app for recording speech datasets.
☆26Jun 27, 2022Updated 4 years ago
hhguo / WaveRNN
View on GitHub
Based on https://github.com/fatchord/WaveRNN
☆24May 3, 2020Updated 6 years ago
Respaired / RiFornet_Vocoder
View on GitHub
a Neural Vocoder supporting Ring Attention, Conformer and NSF.
☆25Aug 1, 2025Updated 11 months ago
jgarciapueyo / MelNet-SpeechGeneration
View on GitHub
Implementation of MelNet in PyTorch to generate high-fidelity audio samples
☆25Sep 16, 2020Updated 5 years ago
LAION-AI / scaled-echo-tts
View on GitHub
Scaled diffusion transformer for text-to-speech synthesis (DiT + T5Gemma2 conditioning, TorchTitan & Megatron backends, tested up to 1024…
☆24Mar 29, 2026Updated 3 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
RicherMans / UIT_Mobile
View on GitHub
Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"
☆24Mar 6, 2023Updated 3 years ago
chenchy / D3Net
View on GitHub
A pytorch implementation of D3Net.
☆11Aug 8, 2021Updated 4 years ago
cschaefer26 / StyleMelGAN
View on GitHub
☆10Apr 8, 2024Updated 2 years ago
mmcauliffe / Pyraat
View on GitHub
Interface for running Praat scripts through Python
☆17May 16, 2025Updated last year
mt-upc / ZeroSwot
View on GitHub
Pushing the Limits of Zero-shot End-to-End Speech Translation
☆25Dec 12, 2024Updated last year
anton-kashkin / hifi_vc
View on GitHub
☆25Jan 24, 2023Updated 3 years ago
rgzn-aiyun / melgan-cpu
View on GitHub
Real-time melgan based on cpu ！！！
☆13Dec 3, 2019Updated 6 years ago
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
Rongjiehuang / Multi-Singer
View on GitHub
PyTorch Implementation of Multi-Singer (ACM-MM'21)
☆139May 8, 2022Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
MingjieChen / DYGANVC
View on GitHub
demo page https://MingjieChen.github.io/dygan-vc
☆66Apr 13, 2022Updated 4 years ago
rgzn-aiyun / tacotron2-melgan
View on GitHub
Mel spectrum based on tacotron2 for melgan speech synthesis
☆15Mar 24, 2023Updated 3 years ago
cnlinxi / style-token_tacotron2
View on GitHub
style token with tacotron2
☆62Jul 6, 2023Updated 3 years ago
Zain-Jiang / Dict-TTS
View on GitHub
☆136Feb 4, 2023Updated 3 years ago
sarulab-speech / multi-speaker-dgp
View on GitHub
Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
☆24Mar 23, 2021Updated 5 years ago
qqueing / pytorch-G2P
View on GitHub
(semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean
☆23Dec 17, 2017Updated 8 years ago
Rongjiehuang / awesome-speech-to-speech-translation
View on GitHub
List of direct speech-to-speech translation papers.
☆39Jan 31, 2023Updated 3 years ago