fishaudio / fish-audio-python
☆68Updated last week
Alternatives and similar repositories for fish-audio-python:
Users that are interested in fish-audio-python are comparing it to the libraries listed below
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆79Updated 3 months ago
- A lightweight end-to-end text-to-speech model☆99Updated 3 weeks ago
- We Speech Transcript based on LLM, in 300 lines of code.☆139Updated this week
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆138Updated 7 months ago
- ☆151Updated last month
- ☆42Updated 10 months ago
- Running the F5-TTS by ONNX Runtime☆80Updated this week
- ✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and langua☆41Updated 2 months ago
- A toolkit for speaker diarization.☆164Updated 2 months ago
- ChatTTS HTTP API☆50Updated 7 months ago
- ☆66Updated last year
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆13Updated 7 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆98Updated last week
- 一个简单的音频降噪工具,提高web UI界面和api接口☆19Updated last month
- VC Without Retrain!☆111Updated 8 months ago
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆119Updated 9 months ago
- 基于达摩院视频切割技术的视频转换为短音频的vits数据集生成工具 A VITS Dataset Generation Tool for Converting Video to Short Audio Based on Damo Academy Video Cutting T…☆54Updated last year
- 用于SenseVoice的api项目,输出带时间戳字幕☆20Updated 2 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Updated 3 years ago
- ☆86Updated last week
- ChatTTS is a generative speech model for daily dialogue.☆21Updated last week
- AMT-APC: AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music Transcription Model☆58Updated 3 weeks ago
- API for a Vocal Remover that uses Deep Neural Networks.☆95Updated 6 months ago
- 基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo☆86Updated 10 months ago
- ☆187Updated 3 months ago
- The inference code of RVC-Boss/GPT-SoVITS that can be developer-friendly.☆11Updated 3 months ago
- ☆45Updated 8 months ago
- The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!☆128Updated last year
- Identify speakers with stable voice timbre.☆27Updated 6 months ago