sungwanha / CsWatchLinks
☆11Updated last year
Alternatives and similar repositories for CsWatch
Users that are interested in CsWatch are comparing it to the libraries listed below
Sorting:
- c# project☆10Updated 6 months ago
- ☆10Updated last year
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆139Updated 7 months ago
- ☆77Updated 4 months ago
- ☆146Updated 8 months ago
- ☆16Updated last year
- [INTERSPEECH 2024] The official implementation of EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for …☆146Updated 2 weeks ago
- The open source code for SimpleSpeech series☆138Updated 7 months ago
- Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training☆134Updated 2 years ago
- ☆29Updated last year
- ☆66Updated last year
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆80Updated 2 months ago
- An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"☆135Updated last year
- perturbation_autovc☆18Updated last year
- End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions☆92Updated last year
- ☆11Updated last month
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆200Updated last year
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆221Updated 11 months ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆91Updated 11 months ago
- Reference-aware automatic speech evaluation toolkit☆153Updated 6 months ago
- ☆117Updated 2 years ago
- [TAFFC 2025] The official implementation of EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vec…☆93Updated last month
- Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)☆152Updated 2 years ago
- [AAAI 2024] Code for CTX-vec2wav in UniCATS☆129Updated 11 months ago
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆141Updated 11 months ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆88Updated 3 years ago
- ☆137Updated last month
- UT-Sarulab MOS prediction system using SSL models☆238Updated last year
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆110Updated 3 years ago
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆145Updated last year