KingNish24 / Realtime-whisper-large-v3-turboLinks

☆57

Alternatives and similar repositories for Realtime-whisper-large-v3-turbo

Users that are interested in Realtime-whisper-large-v3-turbo are comparing it to the libraries listed below

Sorting:

cartesia-ai / cartesia-python
The official Cartesia client for Python.
☆97Updated last week
menloresearch / ichigo-demo
☆91Updated 2 months ago
Nighthawk42 / mOrpheus
Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.
☆66Updated 3 months ago
tarun7r / Vocal-Agent
Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.
☆105Updated last week
matatonic / openedai-whisper
An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.
☆82Updated 6 months ago
amanvirparhar / weebo
A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.
☆233Updated 6 months ago
mahimairaja / awesome-csm-1b
List of curated use cases built using Sesame's CSM 1B
☆69Updated 2 months ago
lalanikarim / webrtc-ai-voice-chat
A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.
☆136Updated last year
JigsawStack / insanely-fast-whisper-api
An API to transcribe audio with OpenAI's Whisper Large v3!
☆296Updated 8 months ago
HumeAI / hume-python-sdk
Python client for Hume AI
☆130Updated this week
playht / pyht
PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API
☆214Updated last week
freddyaboulton / orpheus-cpp
Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)
☆302Updated 3 months ago
KoljaB / LocalEmotionalAIVoiceChat
Simulates talk with an AI that can express emotions
☆77Updated last month
dynamiccreator / voice-text-reader
Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)
☆52Updated 9 months ago
callbacked / os1
A lightweight recreation of OS1/Samantha from the movie Her, running locally in the browser
☆106Updated last month
runpod-workers / worker-faster_whisper
faster-whisper as serverless endpoint
☆109Updated 2 months ago
daily-co / nimble-pipecat
Voice Agent Framework for Conversational AI
☆57Updated 3 months ago
adrienbrault / hf-gguf-to-ollama
Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.
☆116Updated last year
kwindla / macos-local-voice-agents
Pipecat voice AI agents running locally on macOS
☆88Updated last week
dimastatz / whisper-flow
Real-Time Transcription Using OpenAI Whisper
☆257Updated 5 months ago
daily-co / pcc-groq-llama
Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio
☆73Updated last month
tincans-ai / gazelle
Joint speech-language model - respond directly to audio!
☆370Updated last year
JosefAlbers / whisper-turbo-mlx
Blazing fast whisper turbo for ASR (speech-to-text) tasks
☆213Updated 9 months ago
RandomInternetPreson / Lucid_Autonomy
An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…
☆125Updated 9 months ago
WismutHansen / READ2ME
Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files
☆47Updated last month
nhaouari / local11labs
Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.
☆49Updated 6 months ago
HumeAI / hume-api-examples
Example projects built with the Hume AI APIs
☆214Updated 2 weeks ago
pipecat-ai / gemini-multimodal-live-demo
Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat
☆208Updated 4 months ago
tegridydev / auto-md
Convert Files / Folders / GitHub Repos Into AI / LLM-ready Files
☆159Updated 6 months ago
severian42 / MoA-Ollama-Chat
This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…
☆117Updated last year