fishaudio / fish-audio-pythonLinks
The official Python library for the Fish Audio API.
☆136Updated last week
Alternatives and similar repositories for fish-audio-python
Users that are interested in fish-audio-python are comparing it to the libraries listed below
Sorting:
- ☆473Updated 7 months ago
- ☆483Updated 8 months ago
- A lightweight end-to-end text-to-speech model☆125Updated 10 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆174Updated 11 months ago
- Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation☆418Updated last month
- Running the F5-TTS by ONNX Runtime☆188Updated 2 months ago
- Kyutai with an "eye"☆232Updated 9 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆182Updated 6 months ago
- GPT-4o-level, real-time spoken dialogue system.☆363Updated 11 months ago
- ☆338Updated 9 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆86Updated 2 weeks ago
- Open source inference code for Rev's model☆435Updated 8 months ago
- Preprocess Audio for training☆373Updated this week
- Have a natural voice conversation with an LLM☆262Updated 3 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆431Updated last year
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆47Updated 9 months ago
- G2P☆383Updated 5 months ago
- ☆167Updated last year
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆180Updated 2 months ago
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆756Updated last month
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆108Updated 3 months ago
- F5-TTS 推理加速,速度 提升约4倍!☆120Updated last year
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆325Updated 3 weeks ago
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆42Updated 9 months ago
- ☆533Updated 3 months ago
- LongCat Audio Tokenizer and Detokenizer☆268Updated 3 weeks ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆306Updated 7 months ago
- A FastAPI service for text-to-speech synthesis using the F5-TTS model. Includes authentication token☆35Updated 8 months ago
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆643Updated 9 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆72Updated 5 months ago