timhagel / MeloTTS-Docker-API-ServerLinks
A docker image to access MeloTTS through API calls
☆44Updated 11 months ago
Alternatives and similar repositories for MeloTTS-Docker-API-Server
Users that are interested in MeloTTS-Docker-API-Server are comparing it to the libraries listed below
Sorting:
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆70Updated 2 years ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆55Updated this week
- Whisper realtime streaming for long speech-to-text transcription and translation☆119Updated last year
- Live-Transcription (STT) with Whisper PoC☆186Updated last year
- AvaChat - is a realtime AI chat demo with animated talking heads - it uses Large Language Models via api (OpenAI and Claude) as text inpu…☆104Updated last month
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆62Updated 8 months ago
- Running the F5-TTS by ONNX Runtime☆157Updated 3 weeks ago
- Have a natural voice conversation with an LLM☆250Updated 6 months ago
- OpenAI Whisper API-style local server, runnig on FastAPI☆80Updated 6 months ago
- The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!☆137Updated last year
- Simulates talk with an AI that can express emotions☆73Updated last week
- ☆332Updated last year
- Site for sharing Bark voices☆51Updated 3 months ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆12Updated 3 months ago
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆31Updated 6 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆65Updated last week
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆133Updated last year
- API server for Instant voice cloning by MyShell.☆95Updated 9 months ago
- Your Live2D desktop assistant powered by LLM! Available for both Windows and MacOS, it senses your screen, retrieves clipboard content, a…☆82Updated 4 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 6 months ago
- Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.☆114Updated last year
- FastAPI service on top of WhisperX☆111Updated this week
- This is a simple HTTP service that uses the Edge-TTS library to generate text-to-speech audio files.☆30Updated last month
- On-device streaming text-to-speech engine powered by deep learning☆92Updated this week
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆82Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated 11 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆121Updated 2 months ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆166Updated 2 years ago
- Adds a web API to RVC to infer via json requests☆26Updated 11 months ago