timhagel / MeloTTS-Docker-API-Server
A docker image to access MeloTTS through API calls
☆16Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for MeloTTS-Docker-API-Server
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆64Updated last year
- ☆171Updated 11 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆75Updated last year
- ☆296Updated 4 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆39Updated last month
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆53Updated 10 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆105Updated last year
- Speech Driven Lip sync for Web Browser☆27Updated 5 years ago
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆18Updated 3 months ago
- Pybind11 bindings for Whisper.cpp☆45Updated 3 weeks ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆110Updated 5 months ago
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆47Updated 8 months ago
- ☆71Updated this week
- The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!☆120Updated last year
- Live-Transcription (STT) with Whisper PoC☆156Updated 5 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆103Updated 9 months ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆40Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆159Updated 3 months ago
- AvaChat - is a realtime AI chat demo with animated talking heads - it uses Large Language Models (GPT, API2D GPT4, Cluade) as text inputs…☆77Updated last month
- Talk to GPT-4 and create a story together.☆84Updated 11 months ago
- Lip-sync VRM avatar client for zero-webcam mic-based vtubing☆64Updated 2 years ago
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…☆35Updated 9 months ago
- TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.☆61Updated 10 months ago
- ☆24Updated last month
- StoryDiffusion serverless worker☆13Updated 6 months ago
- ☆77Updated 4 months ago
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Updated last year
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆31Updated 2 years ago
- Port of Suno AI's Bark in C/C++ for fast inference☆54Updated 7 months ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆35Updated last year