timhagel / MeloTTS-Docker-API-ServerLinks
A docker image to access MeloTTS through API calls
☆51Updated last year
Alternatives and similar repositories for MeloTTS-Docker-API-Server
Users that are interested in MeloTTS-Docker-API-Server are comparing it to the libraries listed below
Sorting:
- ☆355Updated last year
- FastAPI service on top of WhisperX☆156Updated last week
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆171Updated last month
- The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!☆143Updated 2 years ago
- Local SRT/LLM/TTS Voicechat☆744Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆67Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆71Updated last year
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- plug whisper audio transcription to a local ollama server and ouput tts audio responses☆361Updated last month
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆140Updated last year
- Running the F5-TTS by ONNX Runtime☆184Updated last month
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆72Updated 5 months ago
- OpenAI Whisper API-style local server, runnig on FastAPI☆87Updated 2 months ago
- Real-time Speech To Text using Faster Whisper.☆59Updated last year
- a gradio webui for faster whisper☆275Updated 2 years ago
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 11 months ago
- Live-Transcription (STT) with Whisper PoC☆201Updated last year
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆700Updated 5 months ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆178Updated 2 years ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- Open source inference code for Rev's model☆433Updated 7 months ago
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆36Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆103Updated 3 months ago
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆246Updated 10 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆73Updated last year
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆180Updated last week
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆72Updated 2 years ago
- 阿里SenseVoice 的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆104Updated last year
- ☆477Updated 7 months ago