matatonic / openedai-whisperLinks

An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.

☆80

Alternatives and similar repositories for openedai-whisper

Users that are interested in openedai-whisper are comparing it to the libraries listed below

Sorting:

ValyrianTech / OpenVoice_server
API server for Instant voice cloning by MyShell.
☆96Updated 9 months ago
menloresearch / ichigo-demo
☆91Updated 2 months ago
KartDriver / mira_converse
☆80Updated 4 months ago
davidbrowne17 / chatterbox-streaming
Streaming and Fine-tuning for Chatterbox TTS
☆128Updated last month
Lex-au / Vocalis
Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…
☆178Updated 3 months ago
taresh18 / conversify
🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨
☆66Updated 3 weeks ago
devnen / Dia-TTS-Server
Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), suppor…
☆277Updated last month
sammyf / ollimca
OLLama IMage CAtegorizer
☆67Updated 6 months ago
Haervwe / open-webui-tools
a Repository of Open-WebUI tools to use with your favourite LLMs
☆240Updated last month
tarun7r / Vocal-Agent
Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.
☆102Updated 2 months ago
ExoFi-Labs / OllamaGTTS
☆186Updated 3 months ago
dynamiccreator / voice-text-reader
Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)
☆52Updated 8 months ago
noco-ai / spellbook-docker
AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models
☆160Updated last year
TesslateAI / TFrameX
☆146Updated last week
akashjss / sesame-csm
A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.
☆192Updated 2 months ago
BrainDriveAI / openwebui-pipelines
This repository contains custom pipelines developed for the OpenWebUI framework, including advanced workflows such as long-term memory fi…
☆69Updated 2 months ago
KoljaB / LocalEmotionalAIVoiceChat
Simulates talk with an AI that can express emotions
☆75Updated 3 weeks ago
tegridydev / auto-md
Convert Files / Folders / GitHub Repos Into AI / LLM-ready Files
☆159Updated 5 months ago
dkjroot / iris-llm
IRIS: Demonstrator for use of LLMs in python (outdated)
☆62Updated 3 months ago
kaminoer / KokoDOS
Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with Kokoro TTS voice and vision.
☆57Updated 5 months ago
remichu-ai / gallama
☆131Updated 2 months ago
VideotronicMaker / LM-Studio-Voice-Conversation
Python app for LM Studio-enhanced voice conversations with local LLMs. Uses Whisper for speech-to-text and offers a privacy-focused, acce…
☆101Updated last year
AlgorithmicKing737 / orpheus-tts-local-openai
Run Orpheus 3B Locally With LM Studio
☆31Updated 3 months ago
avarayr / suaveui
Open source LLM UI, compatible with all local LLM providers.
☆175Updated 9 months ago
SomeOddCodeGuy / OfflineWikipediaTextApi
This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …
☆97Updated 3 months ago
masterFoad / NanoSage
Local LLM Powered Recursive Search & Smart Knowledge Explorer
☆244Updated 5 months ago
atgehrhardt / Cerebro-OpenWebUI-Package-Manager
A third-party package manager for OpenWebUI
☆31Updated last year
SingularityMan / vector_companion
A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow you…
☆221Updated last month
thooton / aspen
Personal voice assistant, with voice interruption and Twilio support
☆18Updated 4 months ago
murtaza-nasir / maestro
MAESTRO is an AI-powered research application designed to streamline complex research tasks.
☆160Updated last month