matatonic / openedai-whisper
An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.
☆68Updated last week
Alternatives and similar repositories for openedai-whisper:
Users that are interested in openedai-whisper are comparing it to the libraries listed below
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆53Updated 3 months ago
- ☆91Updated 3 weeks ago
- A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow you…☆136Updated this week
- a Repository of Open-WebUI tools to use with your favourite LLMs☆124Updated this week
- A frontend for creative writing with LLMs☆117Updated 7 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆111Updated 3 months ago
- API server for Instant voice cloning by MyShell.☆83Updated 4 months ago
- Turns devices into a scalable LLM platform☆115Updated this week
- Open source LLM UI, compatible with all local LLM providers.☆170Updated 4 months ago
- An OpenAI API compatible images server to generate or manipulate images.☆14Updated last week
- ☆190Updated 2 weeks ago
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆46Updated 4 months ago
- Convert Files / Folders / GitHub Repos Into AI / LLM-ready Files☆145Updated 2 weeks ago
- ☆28Updated 4 months ago
- ☆123Updated last week
- Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with a more realistic Kokoro TTS voice and vision.☆45Updated 2 weeks ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆114Updated 8 months ago
- ezlocalai is an easy to set up local artificial intelligence server with OpenAI Style Endpoints.☆83Updated this week
- AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models☆150Updated 9 months ago
- OLLama IMage CAtegorizer☆64Updated last month
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆51Updated 3 months ago
- Open-source Perplexity-like RAG app.☆104Updated 2 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆225Updated 2 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆90Updated 7 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆104Updated 7 months ago
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆64Updated 3 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆169Updated 6 months ago
- Eternal is an experimental platform for machine learning models and workflows.☆69Updated 6 months ago