amphionspace / tts-evaluationLinks

An evaluation set for large-scale trained TTS models (Coming in Sep 2024)

☆12

Alternatives and similar repositories for tts-evaluation

Users that are interested in tts-evaluation are comparing it to the libraries listed below

Sorting:

Mddct / simple-tts
（WIP）long form speech generatoins
☆31Updated 10 months ago
MuyangDu / T5Voice
T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …
☆28Updated 3 months ago
ryota-komatsu / speech_resynth
Speech Resynthesis and Language Modeling
☆27Updated 7 months ago
audiodemo / voice-conversion
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Updated 2 years ago
b-sigpro / sed-hsmm
Onset-and-Offset-Aware Sound Event Detection
☆20Updated last year
Mddct / transformer-vocos
☆36Updated 5 months ago
p1an-lin-jung / wv_tts
☆19Updated last year
huutuongtu / Lightvoc
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18Updated last year
shivammehta25 / BetterFastSpeech2
Just another FastSpeech 2 but cleaner code :)
☆29Updated last year
KdaiP / DC-Speech-VAE
5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs
☆57Updated 2 months ago
asuni / PitchSqueezer
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆36Updated 2 years ago
mutiann / neural-lexicon-reader
Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge
☆21Updated 3 years ago
meaningTeam / tidy-tunes
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆22Updated this week
Mddct / cosyvoice2-flow-optimized
faster inference
☆28Updated last year
pengzhendong / audio-pipeline
☆23Updated last year
ex3ndr / supervoice-hybrid
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Updated last year
pengzhendong / streaming-vocos
Streaming Vocos
☆29Updated 8 months ago
xingchensong / CosyVoice-ttsfrd
☆25Updated 7 months ago
ryuclc / CosyVoice2-GRPO
A simple implementation for improving CosyVoice2 by GRPO method
☆32Updated 3 months ago
JusperLee / Gull-Codec-Training
☆13Updated 10 months ago
liuhuang31 / HiFTNet-sr
HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz
☆24Updated 2 years ago
Chengyuann / AutoStyle-TTS
Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…
☆17Updated last week
ogunlao / glowtts_stdp
Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor
☆18Updated 2 years ago
exercise-book-yq / FreeCodec
FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS
☆24Updated last year
xinshengwang / robpitch
A pitch detection model trained to be robust against noise and reverberation environments.
☆27Updated last year
rishikksh20 / MiniMax-TTS-pytorch
Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report
☆49Updated 5 months ago
Mddct / usm-tokenizer
semantic tokenizer for speech and music
☆21Updated 7 months ago
yuan1615 / AdaVocoder
Adaptive Vocoder for Custom Voice
☆61Updated 3 years ago
miccio-dk / NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Updated 3 years ago
liuhuang31 / g2pw_once
G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…
☆14Updated 2 years ago