JoungheeKim/Non-Attentive-Tacotron

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JoungheeKim/Non-Attentive-Tacotron)

JoungheeKim / Non-Attentive-Tacotron

This is Pytorch Implementation of Google's Non-attentive Tacotron.

☆57

Alternatives and similar repositories for Non-Attentive-Tacotron

Users that are interested in Non-Attentive-Tacotron are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BridgetteSong / ExpressiveTacotron
View on GitHub
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…
☆74Sep 21, 2022Updated 3 years ago
hcy71o / AutoVocoder
View on GitHub
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
☆71Dec 2, 2022Updated 3 years ago
keonlee9420 / Parallel-Tacotron2
View on GitHub
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
☆191Nov 18, 2021Updated 4 years ago
rishikksh20 / iSTFT-Avocodo-pytorch
View on GitHub
Ultrafast GAN based Vocoder for Text to Speech
☆50Jul 16, 2022Updated 4 years ago
cnaigithub / Auto_Tuning_Zeroshot_TTS_and_VC
View on GitHub
Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…
☆80May 29, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
bshall / acoustic-model
View on GitHub
Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
☆104Mar 10, 2026Updated 4 months ago
yl4579 / PL-BERT
View on GitHub
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
☆268Jan 13, 2025Updated last year
PlayVoice / BigVGAN
View on GitHub
BigVGAN with Neural Source-Filter
☆58Sep 21, 2023Updated 2 years ago
xcmyz / CLONE
View on GitHub
☆20Jul 13, 2022Updated 4 years ago
idiap / zff_vad
View on GitHub
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
☆23Oct 19, 2023Updated 2 years ago
choiHkk / nix-tts
View on GitHub
End-To-End SpeechSynthesis system with knowledge distillation
☆18Jul 16, 2022Updated 4 years ago
zzw922cn / wesinger2
View on GitHub
Synthesized singing voice demos of WeSinger 2 paper.
☆26Feb 20, 2023Updated 3 years ago
YangAi520 / NSPP
View on GitHub
☆55Mar 2, 2023Updated 3 years ago
rishikksh20 / VocGAN
View on GitHub
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
☆321Jul 25, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
yuan1615 / AdaVocoder
View on GitHub
Adaptive Vocoder for Custom Voice
☆61Sep 22, 2022Updated 3 years ago
ishine / PnG-BERT
View on GitHub
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS
☆24Jan 29, 2022Updated 4 years ago
rishikksh20 / vae_tacotron2
View on GitHub
VAE Tacotron 2, an alternative of GST Tacotron
☆91Jul 6, 2023Updated 3 years ago
rishikksh20 / NaturalSpeech2
View on GitHub
☆69May 19, 2023Updated 3 years ago
rishikksh20 / Fre-GAN-pytorch
View on GitHub
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
☆112Aug 26, 2021Updated 4 years ago
hhguo / MSMC-TTS
View on GitHub
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
☆168Apr 10, 2024Updated 2 years ago
keonlee9420 / Daft-Exprt
View on GitHub
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
☆55Oct 15, 2021Updated 4 years ago
YoungSeng / SRD-VC
View on GitHub
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)
☆119Feb 7, 2024Updated 2 years ago
Daisyqk / Automatic-Prosody-Annotation
View on GitHub
☆112Mar 9, 2026Updated 4 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ubisoft / ubisoft-laforge-daft-exprt
View on GitHub
Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
☆127Apr 8, 2023Updated 3 years ago
LeoniusChen / Attentions-in-Tacotron
View on GitHub
☆69Mar 31, 2021Updated 5 years ago
yerfor / SyntaSpeech
View on GitHub
SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code
☆201Sep 4, 2022Updated 3 years ago
webaverse / LJSpeechTools
View on GitHub
Tools to isolate speaker and transcribe unstructured audio clips
☆11Dec 4, 2022Updated 3 years ago
sarulab-speech / multi-speaker-dgp
View on GitHub
Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
☆24Mar 23, 2021Updated 5 years ago
rishikksh20 / SoundStorm-pytorch
View on GitHub
Google's SoundStorm: Efficient Parallel Audio Generation
☆131Aug 8, 2023Updated 2 years ago
Labmem-Zhouyx / CDFSE_FastSpeech2
View on GitHub
The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…
☆86Dec 20, 2022Updated 3 years ago
shang0712 / HierTTS
View on GitHub
☆47Apr 16, 2023Updated 3 years ago
descriptinc / cargan
View on GitHub
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
☆193Dec 8, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Joshua-1995 / LearnableUpsamplingLayer-Pytorch
View on GitHub
Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)
☆57Mar 12, 2024Updated 2 years ago
keonlee9420 / Comprehensive-E2E-TTS
View on GitHub
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…
☆147Jun 6, 2022Updated 4 years ago
iisys-hof / HUI-Audio-Corpus-German
View on GitHub
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…
☆35Mar 31, 2023Updated 3 years ago
keonlee9420 / Comprehensive-Transformer-TTS
View on GitHub
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…
☆328Sep 24, 2022Updated 3 years ago
hcy71o / SNAC
View on GitHub
Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…
☆57Aug 7, 2023Updated 2 years ago
scutcsq / Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
View on GitHub
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…
☆60Apr 4, 2024Updated 2 years ago
yl4579 / StyleTTS
View on GitHub
Official Implementation of StyleTTS
☆466Jan 13, 2025Updated last year