matatonic / openedai-whisperLinks
An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.
☆77Updated 4 months ago
Alternatives and similar repositories for openedai-whisper
Users that are interested in openedai-whisper are comparing it to the libraries listed below
Sorting:
- API server for Instant voice cloning by MyShell.☆95Updated 9 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆109Updated last week
- Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with Kokoro TTS voice and vision.☆57Updated 4 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 8 months ago
- Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), suppor…☆255Updated 3 weeks ago
- Run Orpheus 3B Locally With LM Studio☆31Updated 3 months ago
- Summarize URL's or files (including YouTube videos via transcripts) using an OpenAI compatible API.☆38Updated 9 months ago
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆26Updated 4 months ago
- ☆79Updated 3 months ago
- ☆184Updated 2 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆170Updated 2 months ago
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆98Updated last month
- a Repository of Open-WebUI tools to use with your favourite LLMs☆232Updated last week
- Simulates talk with an AI that can express emotions☆71Updated last week
- ☆91Updated last month
- A web application that converts speech to speech 100% private☆71Updated 3 weeks ago
- ☆97Updated last year
- A frontend for creative writing with LLMs☆124Updated 11 months ago
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆55Updated last month
- Web UI and API for managing MCP Orchestrator (mcpo) instances and configurations☆72Updated last month
- AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models☆160Updated last year
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆123Updated 8 months ago
- Orpheus Chat WebUI☆65Updated 2 months ago
- An OpenAI API compatible image generation server for the FLUX.1 family of models from Black Forest Labs☆50Updated 9 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆28Updated 5 months ago
- AI powered Chatbot with real time updates.☆57Updated 8 months ago
- A third-party package manager for OpenWebUI☆31Updated 11 months ago
- EPUB, PDF, DOCX, MD, and TXT file text to speech document reader. Read documents in realtime with high-quality TTS; or extract audiobooks…☆157Updated this week
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆359Updated 2 months ago
- ☆130Updated last month