fishaudio / fish-audio-python
☆33Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for fish-audio-python
- ☆76Updated last week
- Running the F5-TTS by ONNX Runtime☆27Updated last week
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆73Updated last month
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆24Updated last week
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆50Updated 3 years ago
- A lightweight end-to-end text-to-speech model☆91Updated last month
- singing voice conversion based on glow-tts☆11Updated last year
- We Speech Transcript based on LLM, in 300 lines of code.☆126Updated 2 months ago
- GPT-style network for phonemization with durations of text☆62Updated 7 months ago
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆124Updated 5 months ago
- 基于vits fastspeech2 visinger的tts模型☆23Updated last year
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆66Updated 7 months ago
- Brand new TTS solution☆8Updated last week
- flow mirror models from JZX AI Labs☆40Updated last month
- ✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and langua☆36Updated last week
- VC Without Retrain!☆102Updated 6 months ago
- ☆12Updated last year
- 单独维护的中文TTS☆35Updated 2 years ago
- 基于达摩院视频切割技术的视频转换为短音频的vits数据集生成工具 A VITS Dataset Generation Tool for Converting Video to Short Audio Based on Damo Academy Video Cutting T…☆52Updated 9 months ago
- Supervoice Speaker Separation Network☆13Updated 5 months ago
- Identify speakers with stable voice timbre.☆26Updated 4 months ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆31Updated this week
- RTVC: Real-Time Voice Conversion GUI☆51Updated last year
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆36Updated last year
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆12Updated 4 months ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆32Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆61Updated last year
- Official Code for ParrotTTS☆42Updated 3 weeks ago