fishaudio / fish-audio-pythonLinks
☆128Updated 4 months ago
Alternatives and similar repositories for fish-audio-python
Users that are interested in fish-audio-python are comparing it to the libraries listed below
Sorting:
- ☆462Updated 4 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆177Updated 3 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆169Updated 7 months ago
- Preprocess Audio for training☆364Updated 7 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆78Updated last week
- Running the F5-TTS by ONNX Runtime☆178Updated 3 weeks ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated 6 months ago
- ☆121Updated this week
- ☆456Updated 4 months ago
- A toolkit for speaker diarization.☆303Updated last month
- A lightweight end-to-end text-to-speech model☆120Updated 7 months ago
- ☆316Updated 5 months ago
- GPT-4o-level, real-time spoken dialogue system.☆356Updated 8 months ago
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆39Updated 6 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆101Updated last year
- Open source inference code for Rev's model☆429Updated 5 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆153Updated 5 months ago
- Kyutai with an "eye"☆219Updated 6 months ago
- ☆203Updated last year
- ✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux☆64Updated 2 weeks ago
- ☆253Updated last month
- F5-TTS 推理加速,速度提升约4倍!☆113Updated 9 months ago
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆657Updated 2 weeks ago
- A FastAPI service for text-to-speech synthesis using the F5-TTS model. Includes authentication token☆35Updated 5 months ago
- ☆184Updated 2 months ago
- Have a natural voice conversation with an LLM☆255Updated this week
- ☆307Updated this week
- G2P☆321Updated last month
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆304Updated 2 months ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆40Updated 11 months ago