matatonic / openedai-whisper
An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.
☆70Updated last month
Alternatives and similar repositories for openedai-whisper:
Users that are interested in openedai-whisper are comparing it to the libraries listed below
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆94Updated this week
- A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow you…☆136Updated last month
- ☆91Updated 2 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 5 months ago
- A Conversational Speech Generation Model with Gradio UI and support for CUDA, MLX and CPU devices☆131Updated this week
- No longer maintained:Your personal ArXiv Curator☆38Updated 4 months ago
- a Repository of Open-WebUI tools to use with your favourite LLMs☆175Updated this week
- ☆67Updated 3 weeks ago
- ☆125Updated last week
- OLLama IMage CAtegorizer☆66Updated 2 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 3 months ago
- Turns devices into a scalable LLM platform☆125Updated this week
- OpenAI compatible TTS for Sesame CSM:1b - Voice Cloning from File/YT☆199Updated this week
- An OpenAI API compatible images server to generate or manipulate images.☆14Updated last month
- An OpenAI API compatible image generation server for the FLUX.1 family of models from Black Forest Labs☆41Updated 6 months ago
- Open source LLM UI, compatible with all local LLM providers.☆173Updated 6 months ago
- API server for Instant voice cloning by MyShell.☆87Updated 5 months ago
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆187Updated 2 months ago
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆25Updated last month
- A frontend for creative writing with LLMs☆122Updated 8 months ago
- ☆46Updated last month
- AI powered Chatbot with real time updates.☆49Updated 5 months ago
- Convert Files / Folders / GitHub Repos Into AI / LLM-ready Files☆151Updated last month
- ☆197Updated last week
- Simulates talk with an AI that can express emotions☆59Updated 7 months ago
- List of curated use cases built using Sesame's CSM 1B☆53Updated last week
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆237Updated 2 weeks ago
- Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with a more realistic Kokoro TTS voice and vision.☆53Updated last month
- A Discord bot for large language models. Add Gemini, Sonnet-3.7 DeepSeek R-1, and other models. Easily change models, edit prompts, and e…☆75Updated this week
- Ollama client written in Python☆158Updated 3 months ago