EvilFreelancer / docker-fish-speech-serverLinks
OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model.
☆22Updated 2 months ago
Alternatives and similar repositories for docker-fish-speech-server
Users that are interested in docker-fish-speech-server are comparing it to the libraries listed below
Sorting:
- ASR on WS FAST_API and Sherpa-onnx. Can use Vosk5 and GigaAM☆12Updated this week
- ☆53Updated this week
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Updated 4 months ago
- Tools and agents for automated research.☆33Updated this week
- Простой нормализатор текстов перед синтезом речи☆33Updated last year
- Effective LLM Alignment Toolkit☆139Updated last month
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆146Updated 2 weeks ago
- Text To Speech Synthesis with Vosk☆200Updated 3 weeks ago
- Framework for processing and filtering datasets☆27Updated last year
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆15Updated 5 months ago
- Have a natural voice conversation with an LLM☆251Updated 7 months ago
- Service for testing out the new Qwen2.5 omni model☆54Updated 3 months ago
- Top ML papers of the week.☆38Updated this week
- A lightweight end-to-end text-to-speech model☆117Updated 5 months ago
- По возможности актуальная информация по ИИ + ресерчи от ChatGPT☆21Updated last month
- Telegram bot for different language models. Supports system prompts and images☆59Updated last month
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆64Updated this week
- Automatic Speech Recognition in Python using ONNX models☆79Updated last month
- FastAPI service on top of WhisperX☆120Updated this week
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆120Updated this week
- Kyutai with an "eye"☆212Updated 4 months ago
- ☆510Updated last month
- ☆83Updated last year
- 100% Local Document deep search with LLMs☆26Updated 11 months ago
- ☆48Updated last month
- an open source ai stylist☆67Updated last month
- ☆23Updated 9 months ago
- Простой IPA фонемизатор на базе ruaccent-encoder☆22Updated 3 months ago
- Thin wrapper around OpenAI Whisper API with streaming support☆89Updated 6 months ago
- A diffusers pipeline for zero shot stylised couples portrait creation☆101Updated 7 months ago