fishaudio / fish-audio-python
☆36Updated last month
Related projects ⓘ
Alternatives and complementary repositories for fish-audio-python
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆75Updated 2 months ago
- Running the F5-TTS by ONNX Runtime☆41Updated this week
- A lightweight end-to-end text-to-speech model☆91Updated 2 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆50Updated 3 years ago
- We Speech Transcript based on LLM, in 300 lines of code.☆127Updated 3 months ago
- ☆77Updated 2 weeks ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆26Updated 3 weeks ago
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆127Updated 5 months ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆14Updated 3 weeks ago
- paraformer web server build with sanic☆19Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆66Updated last year
- flow mirror models from JZX AI Labs☆40Updated last month
- VC Without Retrain!☆104Updated 6 months ago
- 基于达摩院视频切割技术的视频转换为短音频的vits数据集生成工具 A VITS Dataset Generation Tool for Converting Video to Short Audio Based on Damo Academy Video Cutting T…☆52Updated 10 months ago
- singing voice conversion based on glow-tts☆11Updated last year
- GPT-style network for phonemization with durations of text☆62Updated 8 months ago
- ☆12Updated 2 years ago
- 基于vits fastspeech2 visinger的tts模型☆23Updated last year
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆37Updated last year
- Brand new TTS solution☆8Updated this week
- ☆45Updated 4 months ago
- Official Code for ParrotTTS☆43Updated last month
- 基于 g2pW 提升 pypinyin 的准确性☆78Updated last year
- RTVC: Real-Time Voice Conversion GUI☆51Updated last year
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆71Updated 7 months ago
- The inference code of RVC-Boss/GPT-SoVITS that can be developer-friendly.☆11Updated last month
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆33Updated last year
- Identify speakers with stable voice timbre.☆26Updated 5 months ago
- Pseudo Streaming SenseVoice with Hotwords☆89Updated 3 weeks ago