daswer123 / xtts-finetune-tests
In this repository I will be running various experiments on finetune different parts for xtts
☆14Updated 10 months ago
Alternatives and similar repositories for xtts-finetune-tests:
Users that are interested in xtts-finetune-tests are comparing it to the libraries listed below
- StyleTTS 2 Optimized Training Fork☆27Updated 2 months ago
- Official Code for ParrotTTS☆48Updated 6 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆70Updated this week
- audiolm-pytorch training code☆15Updated last year
- finetune llm part for spark-tts model☆57Updated last month
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆69Updated 6 months ago
- ☆56Updated 10 months ago
- Official implementation of the TTS model Lina-Speech☆163Updated 3 months ago
- Open TTS models, built for streaming on the edge☆39Updated last month
- a Frontier Japanese Speech Generation net☆31Updated last month
- Audio tokenization, in the fastest way possible!☆51Updated 8 months ago
- ☆40Updated 2 months ago
- An unofficial PyTorch implementation of VALL-E☆87Updated last week
- Zero-Shot Emotion Style Transfer☆45Updated this week
- ☆13Updated last year
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆35Updated 4 months ago
- All generative model in one for better TTS model☆67Updated 7 months ago
- A collection of all our phonemeizers for dataset construction and inference☆22Updated 2 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆76Updated 5 months ago
- ☆26Updated 5 months ago
- ☆26Updated last year
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆24Updated 9 months ago
- Official Implementation of StyleTTS-VC☆177Updated 3 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆67Updated 5 months ago
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆48Updated last week
- F5-TTS 推理加速,速度提升约4倍!☆78Updated 3 months ago
- ☆50Updated 3 weeks ago
- ☆18Updated 11 months ago
- The official implementation of EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector (TAFFC 20…☆85Updated last week
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated 3 weeks ago