kan-bayashi/Taco2withBERT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kan-bayashi/Taco2withBERT)

kan-bayashi / Taco2withBERT

Tacotron2 with BERT examples

☆10

Alternatives and similar repositories for Taco2withBERT

Users that are interested in Taco2withBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Yolanda-Gao / VoiceGANmodel
View on GitHub
☆19Feb 28, 2018Updated 8 years ago
dhgrs / pytorch-UniWaveNet
View on GitHub
☆31Nov 7, 2018Updated 7 years ago
rishikksh20 / PPSpeech
View on GitHub
PPSpeech: Phrase based Parallel End-to-End TTS System
☆35Aug 31, 2020Updated 5 years ago
MU94W / TTS-Eval
View on GitHub
☆18Aug 9, 2018Updated 7 years ago
npuichigo / extract_features_using_world
View on GitHub
using world vocoder to extract features and make data for training neural networks
☆11Oct 9, 2017Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
shaojinding / Adversarial-Many-to-Many-VC
View on GitHub
[InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …
☆39Mar 24, 2023Updated 3 years ago
cjerry1243 / TransferLearning-CLVC
View on GitHub
Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion
☆40Oct 22, 2022Updated 3 years ago
gonglinyuan / metro_t0
View on GitHub
Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)
☆22Nov 1, 2023Updated 2 years ago
bigpon / SpeechSubjectiveTest
View on GitHub
Speech (audio) subjective evaluation system
☆42Jul 15, 2020Updated 6 years ago
tqbl / dcase2018_task2
View on GitHub
Surrey CVSSP DCASE 2018 Task 2 system
☆20Dec 26, 2022Updated 3 years ago
liusongxiang / efficient_tts
View on GitHub
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
☆116Dec 22, 2021Updated 4 years ago
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
toni-heittola / dcase2019_task1_baseline
View on GitHub
DCASE2019 Challenge Task 1 baseline system
☆20Oct 11, 2019Updated 6 years ago
IU-SAIGE / pse
View on GitHub
Efficient Personalized Speech Enhancement through Self-Supervised Learning
☆23Mar 12, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
mutiann / neural-lexicon-reader
View on GitHub
Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge
☆21Jul 25, 2022Updated 4 years ago
ljuvela / ResGAN
View on GitHub
Core code for my ICASSP 2018 paper
☆53Jul 27, 2018Updated 8 years ago
unilight / cdvae-vc
View on GitHub
TensorFlow Implementation of CDVAE-VC.
☆54Mar 24, 2023Updated 3 years ago
keunwoochoi / UrbanSound8K-preprocessing
View on GitHub
☆11Mar 15, 2017Updated 9 years ago
all-the-noises / eval-arena
View on GitHub
☆34Mar 21, 2026Updated 4 months ago
NeuroWave-ai / CUCVAE-TTS
View on GitHub
☆25Mar 12, 2022Updated 4 years ago
r9y9 / MelGeneralizedCepstrums.jl
View on GitHub
Mel-Generalized Cepstrum analysis
☆19Jul 21, 2017Updated 9 years ago
kan-bayashi / WaveNetVocoderSamples
View on GitHub
WaveNet Vocoder Samples
☆23Aug 23, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ivanvovk / compressed-tacotron2-pytorch
View on GitHub
Compressed version of Tacotron 2 using Tensor Train + Waveglow.
☆22Dec 26, 2019Updated 6 years ago
NEUIR / INTERVENOR
View on GitHub
[ACL '24] Source code for paper: INTERVENOR : Prompt the Coding Ability of Large Language Models with the Interactive Chain of Repairing
☆30Nov 25, 2024Updated last year
thuhcsi / FlatTN
View on GitHub
Chinese Text Normalization and Dataset
☆91May 14, 2022Updated 4 years ago
Yolanda-Gao / VoiceGAN
View on GitHub
These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB
☆50Apr 9, 2019Updated 7 years ago
karthikbhamidipati / multi-task-speech-classification
View on GitHub
Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset
☆28Jul 17, 2026Updated last week
ronggong / DCASE2017-task1
View on GitHub
Homemade LightGBM and VGG-net experiment setup for DCASE2017 task 1
☆11Aug 8, 2017Updated 8 years ago
Takaaki-Saeki / zm-text-tts
View on GitHub
[IJCAI'23] Learning to Speak from Text for Low-Resource TTS
☆65May 30, 2023Updated 3 years ago
vara-tts / VARA-TTS
View on GitHub
Demo audio of VARA-TTS model
☆20Jun 11, 2021Updated 5 years ago
npuichigo / grpc_gateway_demo
View on GitHub
Audio streaming transfer demo with google.api.HttpBody and grpc gateway for speech synthesis
☆20Jan 28, 2020Updated 6 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
deepakacharyab / gnn_feature_selection_extraction
View on GitHub
☆15Oct 23, 2019Updated 6 years ago
crabl / HeadSpace
View on GitHub
An DSP library written in Python for performing HRTFs
☆21Aug 15, 2016Updated 9 years ago
itsuki8914 / Voice-morphing-RelGAN
View on GitHub
A implementation voice morphing using relgan with tensorflow
☆25Mar 24, 2023Updated 3 years ago
austinmoehle / wavernn
View on GitHub
WaveRNN-based waveform generator & demo of TensorFlow CuDNN-GRU usage.
☆24Aug 19, 2018Updated 7 years ago
richardbaihe / a3t
View on GitHub
Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
☆89Sep 6, 2024Updated last year
nii-yamagishilab / self-attention-tacotron
View on GitHub
An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" …
☆114Jun 19, 2020Updated 6 years ago
zhengyang5 / MMED400
View on GitHub
☆13Nov 19, 2024Updated last year