JarodMica / GPT-SoVITS-Package
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
☆18Updated 3 months ago
Alternatives and similar repositories for GPT-SoVITS-Package:
Users that are interested in GPT-SoVITS-Package are comparing it to the libraries listed below
- Misc. tools/scripts that I made to use for tortoise☆21Updated 7 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆32Updated this week
- ☆39Updated 10 months ago
- StyleTTS 2 Optimized Training Fork☆26Updated last month
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆60Updated 2 weeks ago
- Advanced RVC Inference for quicker and effortless model downloads☆47Updated last week
- Collection of the best Applio plugins.☆29Updated 6 months ago
- Real-time end-to-end singing voice convertion☆20Updated 4 months ago
- a Frontier Japanese Speech Generation net☆28Updated 2 weeks ago
- Performs the entire AI cover generation process with UI☆17Updated this week
- YuE with mp3 extend, exllama and GUI☆40Updated last month
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆67Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆52Updated 4 months ago
- A fast MP3 decoder for python, using minimp3☆28Updated 2 years ago
- Heteronym to Phoneme Parser☆18Updated last year
- An unofficial PyTorch implementation of VALL-E☆87Updated this week
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆68Updated 5 months ago
- liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project☆33Updated last year
- ☆16Updated 8 months ago
- ☆58Updated this week
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 5 months ago
- RVC Onnx Infer- Upgraded and simplified-ish☆21Updated 10 months ago
- [SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-vel…☆82Updated 3 months ago
- ☆27Updated last year
- Text-to-Music Generation with Rectified Flow Transformer☆60Updated 6 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆42Updated 3 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆10Updated 6 months ago
- ☆25Updated 4 months ago
- Text prompt steered synthetic audio generators☆46Updated last year
- TTS pipeline that uses RVC to enhance audio quality and cloning☆144Updated last year