fishaudio / fish-audio-pythonLinks
☆108Updated this week
Alternatives and similar repositories for fish-audio-python
Users that are interested in fish-audio-python are comparing it to the libraries listed below
Sorting:
- ☆382Updated last month
- ☆405Updated 2 weeks ago
- We Speech Transcript based on LLM, in 300 lines of code.☆162Updated this week
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆94Updated 8 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆53Updated this week
- GPT-4o-level, real-time spoken dialogue system.☆328Updated 4 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆118Updated this week
- ☆160Updated 6 months ago
- Added vLLM support to IndexTTS for faster inference.☆186Updated this week
- A lightweight end-to-end text-to-speech model☆115Updated 3 months ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆34Updated 7 months ago
- ☆198Updated 8 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆158Updated 3 months ago
- Running the F5-TTS by ONNX Runtime☆154Updated this week
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆114Updated last month
- ☆200Updated last month
- ☆108Updated this week
- xllamacpp - a Python wrapper of llama.cpp☆40Updated this week
- ubuntu 系统下 GLM-4-Voice 部署经验分享☆19Updated 7 months ago
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆24Updated 2 months ago
- ☆17Updated 6 months ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated 2 months ago
- G2P☆251Updated last month
- OSUM: Open Speech Understanding Model, open-sourced by ASLP@NPU.☆369Updated last week
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆62Updated last month
- Preprocess Audio for training☆340Updated 3 months ago
- ChatTTS HTTP API☆53Updated 11 months ago
- A toolkit for speaker diarization.☆195Updated 3 weeks ago
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆233Updated 9 months ago
- API for a Vocal Remover that uses Deep Neural Networks.☆108Updated 11 months ago