qiye45 / Bert-VITS2_easy_trainingLinks

简化Bert-VITS2模型训练

☆9

Alternatives and similar repositories for Bert-VITS2_easy_training

Users that are interested in Bert-VITS2_easy_training are comparing it to the libraries listed below

Sorting:

ex3ndr / supervoice-separate
Supervoice Speaker Separation Network
☆12Updated last year
AI-S2-Lab / EmoPP
[NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech
☆22Updated 11 months ago
40740 / Bert-VITS2-2
☆13Updated last year
lovemefan / SenseVoice-python
SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime
☆96Updated 9 months ago
SLPcourse / Singing-Voice-Conversion
Project of Singing Voice Conversion.
☆15Updated last year
ishine / aishell_annotation
AISHELL开源数据标注平台,包含语音,图像标注,数据质检,验收,统计等功能.
☆24Updated 5 years ago
lucasjinreal / textfrontend
单独维护的中文TTS
☆35Updated 2 years ago
pengzhendong / speaker-diarization
Offline Speaker Diarization with SenseVoice by Sherpa ONNX.
☆13Updated 6 months ago
jdh-algo / JoyTTS
☆21Updated this week
cronrpc / Audio-Speaker-Needle-In-Haystack
Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。
☆13Updated last year
SonyResearch / diffvox
Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"
☆28Updated this week
yeyupiaoling / YeAudio
Python的音频工具
☆15Updated 8 months ago
simplespeech / simplespeechDemo
☆8Updated 11 months ago
Tikai7 / DiTTO-TTS
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors
☆28Updated 5 months ago
pengzhendong / audio-pipeline
☆22Updated 9 months ago
ex3ndr / supervoice-vocoder
Production-ready vocoder using BigVSAN
☆11Updated last year
lucasjinreal / aural
A Tiny Project For ASR model training and Deployment
☆27Updated 2 years ago
nobody996 / FastSVC
Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"
☆20Updated 4 years ago
Sg4Dylan / libvits-ncnn
libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻
☆62Updated 2 years ago
ryanrudes / YTTTS
The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions
☆51Updated 4 years ago
ictnlp / ComSpeech
Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".
☆24Updated last year
ex3ndr / supervoice-hybrid
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Updated 11 months ago
Plachtaa / ASTRAL-quantization
speaker-disentangled speech linguistic content quantizer
☆21Updated 4 months ago
uthree / ddsp-vocoder
☆10Updated 8 months ago
Infinity-INF / fast-phasr
Phonemes and durations labeling based on whisper small
☆11Updated last year
madhavlab / wav2tok
Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"
☆36Updated last year
yoongi43 / VRVQ
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Updated 3 months ago
xinliu9451 / awesome-denoiser
This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …
☆40Updated 7 months ago
ishine / vc-lm
将任意人的音色转换为成千上万种不同音色
☆30Updated 2 years ago
ShoukanLabs / VoPho
A collection of all our phonemeizers for dataset construction and inference
☆24Updated 5 months ago