matthewhand / openai-f5-tts
This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS engine. The API supports customizable voices, including the default voice Emilia, and allows for easy integration into various applications that require speech synthesis.
☆12Updated last month
Alternatives and similar repositories for openai-f5-tts
Users that are interested in openai-f5-tts are comparing it to the libraries listed below
Sorting:
- ☆13Updated 5 months ago
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆12Updated 3 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆47Updated this week
- This is a simple ComfyUI custom TTS node based on Parler_tts.☆42Updated 4 months ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Updated 7 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆33Updated this week
- ☆19Updated 6 months ago
- a custom comfyui node for fish-speech☆38Updated 11 months ago
- ☆20Updated last year
- Real time faster whisper gradio☆26Updated 7 months ago
- An agentic workflow for story book generation☆29Updated last month
- Demo app for Groq plugins in LiveKit Agents☆47Updated last month
- my ai-roles like gpt claude gemini glm 豆包 扣子 comfyui also include some prompts in local ai, try to let user use their ai agent/workflow i…☆10Updated 4 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆109Updated 3 weeks ago
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆15Updated last year
- WhisperMesh is an advanced chatbot that integrates voice and text interactions, delivering personalized responses through LLM models and …☆14Updated 2 weeks ago
- ☆48Updated 6 months ago
- DeepFloyd IF web UI☆30Updated 2 years ago
- ☆16Updated 10 months ago
- Easily create video datasets with auto-captioning for Hunyuan-Video LoRA finetuning☆12Updated last month
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆23Updated last month
- A FastAPI service for text-to-speech synthesis using the F5-TTS model. Includes authentication token☆18Updated 2 weeks ago
- ☆22Updated 10 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆56Updated 5 months ago
- A lightweight end-to-end text-to-speech model☆113Updated 2 months ago
- ☆22Updated last year
- ☆12Updated last year
- A diffusers pipeline for zero shot stylised couples portrait creation☆101Updated 5 months ago
- ☆24Updated 11 months ago
- Animefy: ComfyUI workflow designed to convert images or videos into an anime-like style automatically.☆21Updated 10 months ago