EvilFreelancer / docker-fish-speech-serverLinks
OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model.
☆21Updated last month
Alternatives and similar repositories for docker-fish-speech-server
Users that are interested in docker-fish-speech-server are comparing it to the libraries listed below
Sorting:
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆12Updated 3 months ago
- ASR on WS FAST_API and Sherpa-onnx. Can use Vosk5 and GigaAM☆11Updated this week
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆13Updated 5 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆132Updated this week
- Service for testing out the new Qwen2.5 omni model☆54Updated 2 months ago
- ☆498Updated 3 weeks ago
- Kyutai with an "eye"☆207Updated 3 months ago
- ☆51Updated last week
- Tools and agents for automated research.☆30Updated 2 weeks ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆257Updated 4 months ago
- Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.☆63Updated 2 months ago
- 100% Local Document deep search with LLMs☆26Updated 10 months ago
- Простой нормализатор текстов перед синтезом речи☆33Updated last year
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆64Updated 2 months ago
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆80Updated 5 months ago
- Real time faster whisper gradio☆26Updated 9 months ago
- Docker compose to run vLLM on Windows☆92Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆58Updated this week
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆116Updated this week
- ☆38Updated 5 months ago
- FastAPI service on top of WhisperX☆114Updated this week
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆73Updated this week
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆26Updated 3 weeks ago
- ☆23Updated 8 months ago
- ☆90Updated last week
- ☆101Updated this week
- Streaming and Fine-tuning for Chatterbox TTS☆128Updated 3 weeks ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆98Updated this week
- Framework for processing and filtering datasets☆27Updated 11 months ago
- Effective LLM Alignment Toolkit☆137Updated 2 weeks ago