egorsmkv / radtts-hifiganLinks

RADTTS + HiFiGAN vocoder

☆9

Alternatives and similar repositories for radtts-hifigan

Users that are interested in radtts-hifigan are comparing it to the libraries listed below

Sorting:

patriotyk / narizaka
Tool to make high quality text to speech (tts) corpus from audio + text books.
☆23Updated 2 months ago
egorsmkv / optimized-whisper
Use quantized versions of Whisper to speed up inference
☆12Updated 8 months ago
egorsmkv / asr-corpus-creator
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Updated last year
SolomidHero / real-time-voice-conversion
Toolbox for easy and qualitative one-shot voice conversion
☆45Updated 3 years ago
alumae / online_speaker_change_detector
Online streaming speaker change detection model in Pytorch
☆40Updated 2 years ago
keonlee9420 / Comprehensive-Tacotron2
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…
☆48Updated last year
shigabeev / russian_tts_normalization
Normalize Text in Russian
☆27Updated last year
insunhwang89 / StyleVC
☆31Updated 2 years ago
NeonGeckoCom / nsnet2-denoiser
NSNet2 Deep Noise Suppression (DNS) package
☆36Updated 2 years ago
HHousen / speaker-change-detection
Speaker change detection using SincNet and an LSTM/Transformer
☆52Updated last month
zcf28 / StyleGAN-VC
Voice Conversion method based on speaker style
☆14Updated 3 years ago
ahmedshah1494 / speech_robust_bench
☆15Updated 2 months ago
just-ai / speechflow
☆26Updated last month
msalhab96 / MultiSpeech
pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper
☆21Updated 3 years ago
rendchevi / daisy-tts
🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition
☆15Updated last year
frankyoujian / Edge-Punct-Casing
☆28Updated 4 months ago
BUTSpeechFIT / TS-ASR-Whisper
☆73Updated last week
freds0 / data_augmentation_for_asr
A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.
☆43Updated 3 years ago
rioharper / VocalForge
Your one-stop solution for voice dataset creation
☆119Updated last year
kamepong / ConvS2S-VC
☆29Updated 3 years ago
iisys-hof / HUI-Audio-Corpus-German
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…
☆31Updated 2 years ago
imdatceleste / m-ailabs-dataset
This is the M-AILABS Speech Dataset
☆68Updated 7 months ago
RF5 / transfusion-asr
Transcribing Speech with Multinomial Diffusion, training code and models.
☆77Updated last year
leohuang2013 / pyannote-audio_overlapped-speech-detection_cpp
C++ version of pyannote audio overlapped speech detection pipeline
☆13Updated last year
Takaaki-Saeki / zm-text-tts
[IJCAI'23] Learning to Speak from Text for Low-Resource TTS
☆63Updated 2 years ago
hitz-zentroa / whisper-lm
Add n-gram and large language model (LLM) support to Whisper models.
☆26Updated last month
wetdog / wavenext_pytorch
Unofficial implementation of wavenext vocoder
☆47Updated 10 months ago
DigitalPhonetics / VoicePAT
VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.
☆50Updated last year
aixplain / tts-qa
☆63Updated last year
gitmylo / bark-data-gen
Create training data for training a voice cloner for bark text to speech.
☆45Updated 2 years ago