matthewhand / openai-f5-ttsLinks
This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS engine. The API supports customizable voices, including the default voice Emilia, and allows for easy integration into various applications that require speech synthesis.
☆12Updated 3 months ago
Alternatives and similar repositories for openai-f5-tts
Users that are interested in openai-f5-tts are comparing it to the libraries listed below
Sorting:
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆13Updated 4 months ago
- Real time faster whisper gradio☆26Updated 8 months ago
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆70Updated last week
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆55Updated this week
- ☆14Updated 7 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆124Updated 2 weeks ago
- xllamacpp - a Python wrapper of llama.cpp☆44Updated this week
- Eko Browser Extension Template☆30Updated last month
- Demo app for Groq plugins in LiveKit Agents☆52Updated 2 months ago
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆26Updated last week
- A Chinese Conversation Dataset with Rich Annotations for Super-Aged Seniors☆15Updated last month
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 4 months ago
- Turn Dify API into OpenAI API schema☆15Updated 10 months ago
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆74Updated 7 months ago
- OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model.☆19Updated 3 weeks ago
- This repository provides a Docker image for CosyVoice☆21Updated 6 months ago
- Auto Thinking Mode switch for Qwen3 in Open webui☆65Updated last month
- ☆20Updated last year
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆63Updated last month
- 👂 Typing is slow, talk to me. The project name means ' i am tired ' in Chinese (我累了). This is a AI efficiency assistant, complete your d…☆14Updated last year
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAI☆13Updated 2 weeks ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆35Updated 7 months ago
- A lightweight end-to-end text-to-speech model☆114Updated 4 months ago
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆35Updated 2 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆56Updated 6 months ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Updated 9 months ago
- ☆83Updated 11 months ago
- ☆18Updated last year
- ☆22Updated last year
- Incredibly descriptive audiovisual summaries for videos☆41Updated 10 months ago