timhagel / MeloTTS-Docker-API-ServerLinks

A docker image to access MeloTTS through API calls

☆44

Alternatives and similar repositories for MeloTTS-Docker-API-Server

Users that are interested in MeloTTS-Docker-API-Server are comparing it to the libraries listed below

Sorting:

OpenReplicant / ProtoReplicant
AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)
☆70Updated 2 years ago
ai-bot-pro / achatbot
An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.
☆55Updated this week
luweigen / whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
☆119Updated last year
gaborvecsei / whisper-live-transcription
Live-Transcription (STT) with Whisper PoC
☆186Updated last year
Jaykef / AvaChat
AvaChat - is a realtime AI chat demo with animated talking heads - it uses Large Language Models via api (OpenAI and Claude) as text inpu…
☆104Updated last month
eustlb / speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
☆62Updated 8 months ago
DakeQQ / F5-TTS-ONNX
Running the F5-TTS by ONNX Runtime
☆157Updated 3 weeks ago
Finity-Alpha / OpenVoiceChat
Have a natural voice conversation with an LLM
☆250Updated 6 months ago
morioka / tiny-openai-whisper-api
OpenAI Whisper API-style local server, runnig on FastAPI
☆80Updated 6 months ago
kenwaytis / faster-SadTalker-API
The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!
☆137Updated last year
KoljaB / LocalEmotionalAIVoiceChat
Simulates talk with an AI that can express emotions
☆73Updated last week
coqui-ai / xtts-streaming-server
☆332Updated last year
rsxdalv / bark-speaker-directory
Site for sharing Bark voices
☆51Updated 3 months ago
matthewhand / openai-f5-tts
This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …
☆12Updated 3 months ago
esnya / realtime-whisper
ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers
☆31Updated 6 months ago
KoljaB / stream2sentence
Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.
☆65Updated last week
lalanikarim / webrtc-ai-voice-chat
A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.
☆133Updated last year
ValyrianTech / OpenVoice_server
API server for Instant voice cloning by MyShell.
☆95Updated 9 months ago
ylxmf2005 / LLM-Live2D-Desktop-Assitant
Your Live2D desktop assistant powered by LLM! Available for both Windows and MacOS, it senses your screen, retrieves clipboard content, a…
☆82Updated 4 months ago
pipecat-ai / web-client-ui
An JS web client for connecting to Pipecat bots with voice and vision
☆45Updated 6 months ago
nrl-ai / CustomChar
Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.
☆114Updated last year
pavelzbornik / whisperX-FastAPI
FastAPI service on top of WhisperX
☆111Updated this week
doctoroyy / edge-tts-as-a-service
This is a simple HTTP service that uses the Edge-TTS library to generate text-to-speech audio files.
☆30Updated last month
Picovoice / orca
On-device streaming text-to-speech engine powered by deep learning
☆92Updated this week
hanifabd / voice-activity-detection-vad-realtime
Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)
☆82Updated last year
sidharthrajaram / StyleTTS2
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
☆160Updated 11 months ago
rpdrewes / whisper-websocket-server
Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.
☆64Updated last year
ruzhila / voiceapi
Streaming ASR and TTS based on FastAPI+ sherpa-onnx
☆121Updated 2 months ago
mldljyh / whisper_real_time_translation
The subtitles and translations are generated in real-time and displayed as pop-ups.
☆166Updated 2 years ago
SocAIty / Retrieval-based-Voice-Conversion-FastAPI
Adds a web API to RVC to infer via json requests
☆26Updated 11 months ago