34j / awesome-vitsLinks
List of repositories relevant to VITS.
☆35Updated 2 years ago
Alternatives and similar repositories for awesome-vits
Users that are interested in awesome-vits are comparing it to the libraries listed below
Sorting:
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- ☆29Updated last year
- Non Parallel Voice Conversion based on VITS☆24Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- AudioSR-Upsampling (any -> 48kHz)☆41Updated last year
- Official implementation for FlowSep☆49Updated 5 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- Singing Voice Synthesis based on VITS, different from VISinger☆190Updated last year
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆47Updated 2 months ago
- BigVGAN with Neural Source-Filter☆55Updated last year
- Finetuning VITS Efficiently☆33Updated last year
- Putting flows on top of neural transducers for better TTS☆62Updated last week
- a lightweight voice conversion☆82Updated 9 months ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆29Updated 2 years ago
- ☆56Updated 11 months ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆66Updated 3 weeks ago
- ☆24Updated 3 weeks ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆119Updated 2 years ago
- Monotonic Alignment Search☆91Updated 2 years ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 11 months ago
- Official Implementation of StyleTTS-VC☆180Updated 4 months ago
- A collection of all our phonemeizers for dataset construction and inference☆23Updated 3 months ago
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions☆77Updated 7 months ago
- Collect Voice Conversion researches☆93Updated this week
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆132Updated last year
- An unofficial PyTorch implementation of VALL-E☆87Updated last week
- Zero-Shot Emotion Style Transfer☆45Updated last month
- Adaptive Vocoder for Custom Voice☆59Updated 2 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆127Updated 2 years ago