asaddi / f5-tts-serveLinks

A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatible API endpoint for speech generation

☆14

Alternatives and similar repositories for f5-tts-serve

Users that are interested in f5-tts-serve are comparing it to the libraries listed below

Sorting:

phildougherty / qwen2.5_omni_chat
Service for testing out the new Qwen2.5 omni model
☆54Updated 2 months ago
matthewhand / openai-f5-tts
This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …
☆13Updated 4 months ago
nick-tonjum / open-webui-artifacts-overhaul
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
☆276Updated 3 months ago
thad0ctor / llama-server-launcher
☆103Updated this week
Lex-au / Vocalis
Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…
☆195Updated 3 months ago
devnen / Chatterbox-TTS-Server
Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible),…
☆397Updated last week
ThetaCursed / clean-ui
Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D
☆136Updated 9 months ago
Fus3n / gem-assist
Command-line personal assistant using your favorite proprietary or local models with access to over 30+ tools
☆110Updated 3 weeks ago
ReisCook / Voice_Extractor
Automated speech dataset creator
☆159Updated last month
PasiKoodaa / ACE-Step-RADIO
ACE-Step: A Step Towards Music Generation Foundation Model
☆41Updated 2 months ago
amanvirparhar / weebo
A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.
☆233Updated 6 months ago
parsakhaz / open-ai-stylist
an open source ai stylist
☆65Updated 3 weeks ago
matatonic / openedai-vision
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
☆258Updated 4 months ago
TesslateAI / TFrameX
☆150Updated this week
akashjss / orpheus-tts-local-webui
Run Orpheus 3B Locally with Gradio UI, Standalone App
☆23Updated 3 months ago
cocktailpeanut / hallucinator
☆51Updated 8 months ago
petermg / Chatterbox-TTS-Extended
Modified version of Chatterbox that accepts text files as input and no character restrictions
☆330Updated 3 weeks ago
NeuralFalconYT / Kokoro-82M-WebUI
☆39Updated 5 months ago
kevkid / gguf_gui
☆116Updated 8 months ago
ETomberg391 / Ecne-AI-Podcaster
AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast
☆75Updated last week
MehulG / memX
A real-time shared memory layer for multi-agent LLM systems.
☆42Updated 3 weeks ago
PasiKoodaa / dia
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆31Updated 2 months ago
astramind-ai / Pulsar
The hearth of The Pulsar App, fast, secure and shared inference with modern UI
☆57Updated 7 months ago
kanttouchthis / text_generation_webui_xtts
XTTSv2 Extension for oobabooga text-generation-webui
☆155Updated last year
gpustack / vox-box
A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
☆137Updated last week
and270 / thinking_effort_processor
☆90Updated 2 weeks ago
CalvesGEH / VoiceCraftAPI
An API for VoiceCraft.
☆25Updated last year
Independent-AI-Labs / local-super-agents
Privacy-first agentic framework with powerful reasoning & task automation capabilities. Natively distributed and fully ISO 27XXX complian…
☆66Updated 3 months ago
Nighthawk42 / mOrpheus
Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.
☆64Updated 2 months ago
SystemPanic / vllm-windows
A high-throughput and memory-efficient inference and serving engine for LLMs (Windows build & kernels)
☆103Updated last month