egorsmkv / radtts-hifiganLinks
RADTTS + HiFiGAN vocoder
☆9Updated 2 years ago
Alternatives and similar repositories for radtts-hifigan
Users that are interested in radtts-hifigan are comparing it to the libraries listed below
Sorting:
- Tool to make high quality text to speech (tts) corpus from audio + text books.☆23Updated 2 months ago
- Use quantized versions of Whisper to speed up inference☆12Updated 8 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Toolbox for easy and qualitative one-shot voice conversion☆45Updated 3 years ago
- Online streaming speaker change detection model in Pytorch☆40Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- Normalize Text in Russian☆27Updated last year
- ☆31Updated 2 years ago
- NSNet2 Deep Noise Suppression (DNS) package☆36Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆52Updated last month
- Voice Conversion method based on speaker style☆14Updated 3 years ago
- ☆15Updated 2 months ago
- ☆26Updated last month
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- ☆28Updated 4 months ago
- ☆73Updated last week
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆43Updated 3 years ago
- Your one-stop solution for voice dataset creation☆119Updated last year
- ☆29Updated 3 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆31Updated 2 years ago
- This is the M-AILABS Speech Dataset☆68Updated 7 months ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆77Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- Add n-gram and large language model (LLM) support to Whisper models.☆26Updated last month
- Unofficial implementation of wavenext vocoder☆47Updated 10 months ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆50Updated last year
- ☆63Updated last year
- Create training data for training a voice cloner for bark text to speech.☆45Updated 2 years ago