5Hyeons / StyleTTS2-VocosLinks
StyleTTS2 + Vocos as a Decoder
☆13Updated 7 months ago
Alternatives and similar repositories for StyleTTS2-Vocos
Users that are interested in StyleTTS2-Vocos are comparing it to the libraries listed below
Sorting:
- StyleTTS 2 Optimized Training Fork☆34Updated 9 months ago
- ☆14Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated last year
- High quality text-to-speech based on StyleTTS 2.☆70Updated last week
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Updated last year
- ☆70Updated last year
- Training code and dataset cleasing with Sidon☆42Updated last week
- ☆28Updated 2 years ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆27Updated 8 months ago
- Unofficial implementation of wavenext vocoder☆52Updated last year
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Updated 2 years ago
- ☆25Updated last year
- ☆49Updated 4 months ago
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆46Updated last week
- My vocoder experiments☆31Updated 3 months ago
- speaker-disentangled speech linguistic content quantizer☆23Updated 7 months ago
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆35Updated 6 months ago
- pytorch model for contexless-phoneme prediction from speech audio☆29Updated 2 weeks ago
- ☆44Updated 4 months ago
- This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …☆56Updated 3 months ago
- ☆43Updated last year
- ☆19Updated last year
- Official PyTorch implementation of "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis…☆16Updated 8 months ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- Official repository of Wavehax vocoder☆56Updated 3 months ago
- Collection of scripts from mHuBERT-147.☆32Updated 11 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 6 months ago
- Supervoice diffusion enhance☆27Updated last year
- ☆19Updated last year