matthewhand / openai-f5-ttsLinks
This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS engine. The API supports customizable voices, including the default voice Emilia, and allows for easy integration into various applications that require speech synthesis.
☆12Updated 3 months ago
Alternatives and similar repositories for openai-f5-tts
Users that are interested in openai-f5-tts are comparing it to the libraries listed below
Sorting:
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆132Updated this week
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆73Updated this week
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆13Updated 5 months ago
- ☆83Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆58Updated this week
- 用于SenseVoice的api项目,输出带时间戳字幕☆36Updated 8 months ago
- An agentic workflow for story book generation☆30Updated 3 months ago
- Real time faster whisper gradio☆26Updated 9 months ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated 3 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆96Updated 9 months ago
- A lightweight end-to-end text-to-speech model☆115Updated 4 months ago
- ☆14Updated 7 months ago
- ☆18Updated 7 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆64Updated 2 months ago
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆74Updated 8 months ago
- an open source ai stylist☆64Updated last week
- OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model.☆21Updated last month
- A FastAPI service for text-to-speech synthesis using the F5-TTS model. Includes authentication token☆30Updated 2 months ago
- ☆20Updated last year
- A diffusers pipeline for zero shot stylised couples portrait creation☆101Updated 7 months ago
- ☆41Updated last year
- a custom comfyui node for fish-speech☆39Updated last year
- ☆23Updated 8 months ago
- This is a simple ComfyUI custom TTS node based on Parler_tts.☆44Updated last week
- ☆116Updated last month
- ☆54Updated last year
- ComfyUI wrapper for Moondream's gaze detection☆55Updated 5 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 5 months ago
- Have a natural voice conversation with an LLM☆250Updated 7 months ago
- Archived 🚧|🌻Building ChatBot with LLMs.🌻 | Using async requests. | 具有多 LLM 适应性 | 通用大语言模型代理端框架 |多人称全类型注解☆40Updated last year