StyleTTS2 + Vocos as a Decoder
☆13Mar 24, 2025Updated 11 months ago
Alternatives and similar repositories for StyleTTS2-Vocos
Users that are interested in StyleTTS2-Vocos are comparing it to the libraries listed below
Sorting:
- StyleTTS 2 Optimized Training Fork☆33Feb 2, 2025Updated last year
- High quality text-to-speech based on StyleTTS 2.☆73Feb 25, 2026Updated last week
- chatterbox TTS + Voice Clone using onnx☆27Dec 31, 2025Updated 2 months ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- ☆14Aug 19, 2024Updated last year
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆52May 22, 2025Updated 9 months ago
- ☆57Feb 8, 2026Updated 3 weeks ago
- KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with 15M params for real-time, high-quality voices. Open source, fas…☆23Updated this week
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- text to speech☆10Mar 19, 2024Updated last year
- ☆11Mar 22, 2023Updated 2 years ago
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆16Feb 1, 2026Updated last month
- Openfst mirror with some fixes☆14Aug 23, 2024Updated last year
- ☆14Aug 1, 2025Updated 7 months ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Using OpenVINO to speed up MeloTTS inference☆15Nov 1, 2024Updated last year
- ☆52Jun 24, 2025Updated 8 months ago
- DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently☆11Jun 6, 2024Updated last year
- Tracking beer/wine using Audio Event Detection with Machine Learning☆15Jun 16, 2024Updated last year
- Implementation of the paper: StyleBERT: Text-Audio Sentiment Analysis with Bi-directional Style Enhancement☆14Apr 10, 2023Updated 2 years ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆54Sep 25, 2023Updated 2 years ago
- Export the STFT or ISTFT process in ONNX format.☆40Nov 21, 2025Updated 3 months ago
- A Study of Low-Resource Speech Commands Recognition Based on Adversarial Reprogramming☆19Oct 12, 2023Updated 2 years ago
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- ☆58Jun 28, 2024Updated last year
- Onnx compatible styletts2 code☆17Jun 8, 2025Updated 8 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 9 months ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…☆17Apr 27, 2023Updated 2 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Nov 15, 2025Updated 3 months ago
- ☆14Feb 9, 2023Updated 3 years ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Apr 6, 2025Updated 10 months ago
- ☆21Mar 7, 2025Updated 11 months ago
- Megatts2 use HierSpeechpp's vocoder☆18Dec 2, 2024Updated last year
- ☆16Dec 23, 2021Updated 4 years ago