timhagel / MeloTTS-Docker-API-ServerLinks
A docker image to access MeloTTS through API calls
☆43Updated 10 months ago
Alternatives and similar repositories for MeloTTS-Docker-API-Server
Users that are interested in MeloTTS-Docker-API-Server are comparing it to the libraries listed below
Sorting:
- Running the F5-TTS by ONNX Runtime☆154Updated this week
- ☆329Updated 11 months ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆12Updated 2 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆114Updated last month
- FastAPI service on top of WhisperX☆102Updated this week
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆69Updated 2 years ago
- The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!☆135Updated last year
- Running the F5-TTS by ONNX Runtime standalone with GUI☆19Updated 5 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆53Updated this week
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆30Updated 5 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆132Updated 11 months ago
- ☆174Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆57Updated last month
- Site for sharing Bark voices☆51Updated 2 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated 10 months ago
- streaming speech to text server using Whisper☆92Updated 2 years ago
- G2P☆251Updated last month
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆94Updated 8 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆81Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆85Updated this week
- Simulates talk with an AI that can express emotions☆69Updated 10 months ago
- lipsync is a simple and updated Python library for lip synchronization, based on Wav2Lip. It synchronizes lips in videos and images based…☆124Updated 4 months ago
- API server for Instant voice cloning by MyShell.☆93Updated 8 months ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- ☆108Updated this week
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- Full version of wav2lip-onnx including face alignment and face enhancement and more...☆117Updated last week
- OpenAI Whisper API-style local server, runnig on FastAPI☆80Updated 6 months ago
- A simple TTS server for generating speech using StyleTTS2☆38Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 5 months ago