AnyaCoder / fish-speechLinks
Brand new TTS solution
☆10Updated 5 months ago
Alternatives and similar repositories for fish-speech
Users that are interested in fish-speech are comparing it to the libraries listed below
Sorting:
- ☆12Updated 2 years ago
- StyleTTS 2 Optimized Training Fork☆29Updated 4 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆75Updated last week
- 基于vits fastspeech2 visinger的tts模型☆23Updated 2 years ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated 2 years ago
- Project of Singing Voice Conversion.☆14Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆23Updated 3 months ago
- ☆29Updated last year
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆37Updated 6 months ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆24Updated last year
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆47Updated 2 months ago
- Cantonese Text to Speech with VITS implementation☆30Updated 2 years ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 9 months ago
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆12Updated last year
- Non Parallel Voice Conversion based on VITS☆24Updated 2 years ago
- RTVC: Real-Time Voice Conversion GUI☆55Updated last year
- Real-time end-to-end singing voice convertion☆22Updated 7 months ago
- 重构GPT-SOVITS的项目,重写了部分代码,优化了webui的使用以及增加了api调用☆27Updated 5 months ago
- ☆138Updated 3 months ago
- VITS with phoneme-level prosody modeling based on MaskGIT☆81Updated 9 months ago
- Putting flows on top of neural transducers for better TTS☆62Updated last week
- A collection of neural vocoders suitable for singing voice synthesis tasks.☆124Updated 2 months ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆17Updated last year
- speaker-disentangled speech linguistic content quantizer☆16Updated 2 months ago
- ☆51Updated 2 weeks ago
- Multispeaker Community Vocoder Model for DiffSinger☆37Updated last month
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated last month
- Vocal Remover using Deep Neural Networks☆17Updated 5 months ago
- ☆67Updated last year
- ☆108Updated this week