AnyaCoder / fish-speechLinks
Brand new TTS solution
☆11Updated last year
Alternatives and similar repositories for fish-speech
Users that are interested in fish-speech are comparing it to the libraries listed below
Sorting:
- RTVC: Real-Time Voice Conversion GUI☆60Updated 2 years ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆94Updated 3 months ago
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆55Updated 2 months ago
- ☆155Updated last year
- ☆128Updated this week
- A collection of all our phonemeizers for dataset construction and inference☆27Updated 11 months ago
- ☆67Updated 6 months ago
- Vocal Remover using Deep Neural Networks☆19Updated last year
- 数据集自动化制作脚本☆72Updated 2 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated last year
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆50Updated last week
- Sovits5 with RMVPE☆14Updated 2 years ago
- StyleTTS 2 Optimized Training Fork☆33Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆108Updated 4 months ago
- ☆13Updated 3 weeks ago
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆44Updated 10 months ago
- A collection of neural vocoders suitable for singing voice synthesis tasks.☆149Updated this week
- 🌻 VITS ONNX TTS server designed for fast inference 🔥☆130Updated last year
- Googleの音声復元モデルMiipher-2の再現実装の学習および推論コード。学習済みモデルも公開しています。☆30Updated 6 months ago
- DiffSinger dataset processing tools, including audio processing, labeling.☆69Updated 2 weeks ago
- SOFA_AI: Singing-Oriented Forced Aligner for Automatic Inference☆24Updated last year
- F5-TTS 推理加速,速度提升约4倍!☆122Updated last year
- The official code repository for SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Tran…☆149Updated 2 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆81Updated last year
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆27Updated 3 weeks ago
- 大量の音声データから笑い声部分を集めるやつ☆12Updated last year
- ONNX and TensorRT implementation of Whisper☆66Updated 2 years ago
- General Prior for Anime - 1☆44Updated 2 years ago
- VI-SVC model is just VITS without MAS and DurationPredictor.☆10Updated 2 years ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62Updated 2 years ago