Yoshifumi-Nakano/visual-text-to-speech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Yoshifumi-Nakano/visual-text-to-speech)

Yoshifumi-Nakano / visual-text-to-speech

visual-text to speech

☆14

Alternatives and similar repositories for visual-text-to-speech

Users that are interested in visual-text-to-speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tctsigemura / Exam
View on GitHub
Examination Questions in the Dept. of Computer Science and Electronic Engineering.
☆11May 26, 2026Updated 2 months ago
KeisukeImoto / RWCPSSD_Onomatopoeia
View on GitHub
RWCP-SSD-Onomatopoeia
☆24Jun 28, 2023Updated 3 years ago
hs-oh-prml / EmotionControllableTextToSpeech
View on GitHub
☆21Jun 16, 2021Updated 5 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
marytts / pavoque-data
View on GitHub
PAVOQUE Corpus of Expressive Speech
☆12Aug 2, 2016Updated 9 years ago
line / WaveTrainerFit
View on GitHub
Official implementation of "Wave-Trainer-Fit: Neural Vocoder with Trainable Prior and Fixed-Point Iteration towards High-Quality Speech G…
☆16Feb 6, 2026Updated 5 months ago
yamachu / julius4seg
View on GitHub
Juliusを使ったセグメンテーション支援ツール
☆14Feb 13, 2020Updated 6 years ago
patrickltobing / shallow-wavenet
View on GitHub
☆18Feb 9, 2020Updated 6 years ago
kohei0209 / self-remixing
View on GitHub
Official implementation of Self-Remixing
☆18Feb 3, 2024Updated 2 years ago
Takaaki-Saeki / ssl_speech_restoration
View on GitHub
SelfRemaster: SSL Speech Restoration
☆94Jan 5, 2024Updated 2 years ago
JeremyCCHsu / vc-vawgan
View on GitHub
Network specification and demo
☆35Jun 5, 2017Updated 9 years ago
zerospeech / zerospeech2020
View on GitHub
Python package for the Zero Speech Challenge 2020
☆14Feb 5, 2021Updated 5 years ago
BiSinger-SVS / BiSinger
View on GitHub
Bilingual Singing Voice Synthesis
☆18Mar 25, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
karchkha / MelSpec_GPT_VQVAE
View on GitHub
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Oct 8, 2023Updated 2 years ago
light1726 / BetaVAE_VC
View on GitHub
Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"
☆43Apr 10, 2023Updated 3 years ago
auspicious3000 / SpeechSplit-Demo
View on GitHub
Unsupervised Speech Decomposition via Triple Information Bottleneck
☆14Apr 29, 2020Updated 6 years ago
ZackHodari / discrete_intonation
View on GitHub
Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…
☆17May 24, 2020Updated 6 years ago
qCanoe / Google-Scholar-Bibtex-Copy
View on GitHub
☆14Nov 13, 2025Updated 8 months ago
ttslr / StrengthNet
View on GitHub
[INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
☆83Nov 4, 2022Updated 3 years ago
jhrcook / bayesian-data-analysis-course
View on GitHub
My notes and work for the Bayesian Data Analysis course taught by Aki Vehtari.
☆18Oct 29, 2022Updated 3 years ago
XinyuZhou2000 / Spoken-Dialogue
View on GitHub
☆18Dec 7, 2023Updated 2 years ago
Wendison / FCL-taco2
View on GitHub
Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021
☆41Jul 17, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
DeNA / Face2Speech
View on GitHub
☆20Mar 16, 2020Updated 6 years ago
thuhcsi / icassp2021-emotion-tts
View on GitHub
Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/
☆34Mar 17, 2023Updated 3 years ago
KrishnaDN / BERTphone
View on GitHub
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
☆17Dec 10, 2020Updated 5 years ago
jefflai108 / Unsupervised-TTS
View on GitHub
☆42Mar 25, 2022Updated 4 years ago
cnlinxi / tpse_tacotron2
View on GitHub
TPSE-GST Tacotron2
☆14May 1, 2019Updated 7 years ago
xinshengwang / ICASSP2021_paper_list-VC
View on GitHub
ICASSP 2021 accepted papers in term of voice conversion (VC)
☆18Apr 11, 2021Updated 5 years ago
Jarviswx / tonghuashun_text_matching
View on GitHub
同花顺算法挑战平台：【9-10双月赛】跨领域迁移的文本语义匹配
☆11Oct 28, 2021Updated 4 years ago
taketakeseijin / HarmonicLowering
View on GitHub
Implementation of Harmonic Convolution by Harmonic Lowering
☆17Nov 11, 2020Updated 5 years ago
stappit / bayesian-data-analysis
View on GitHub
Berlin Bayesians' solutions to Bayesian Data Analysis, 3rd edition.
☆21Sep 21, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sarulab-speech / xvector_jtubespeech
View on GitHub
xvector model on jtubespeech
☆47Nov 5, 2023Updated 2 years ago
DwangoMediaVillage / pydomino
View on GitHub
日本語音声に対して音素ラベルをアラインメントするためのツールです
☆40Aug 19, 2025Updated 11 months ago
ranxi2001 / zero2Leetcode
View on GitHub
从零基础 Python 到企业笔试机试的系统性LeetCode100刷题指南-配置免费AI刷题助手
☆22Updated this week
bastibe / MAPS-Scripts
View on GitHub
A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.
☆25Mar 29, 2021Updated 5 years ago
roedoejet / FastSpeech2
View on GitHub
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
☆22Jul 5, 2023Updated 3 years ago
freds0 / CML-TTS-Dataset
View on GitHub
CML-TTS: A Multilingual Dataset for Speech Synthesis
☆36Jul 31, 2024Updated last year
hhguo / MSMC-TTS
View on GitHub
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
☆168Apr 10, 2024Updated 2 years ago