KingNish24 / Realtime-whisper-large-v3-turboLinks
☆53Updated 3 weeks ago
Alternatives and similar repositories for Realtime-whisper-large-v3-turbo
Users that are interested in Realtime-whisper-large-v3-turbo are comparing it to the libraries listed below
Sorting:
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆98Updated last month
- Simulates talk with an AI that can express emotions☆71Updated last week
- List of curated use cases built using Sesame's CSM 1B☆66Updated 3 weeks ago
- ☆91Updated last month
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 8 months ago
- Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.☆61Updated last month
- Efficient approach to speaker diarization using voice characteristics extraction☆97Updated last week
- Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with Kokoro TTS voice and vision.☆57Updated 4 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆47Updated this week
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆23Updated 2 months ago
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆30Updated 8 months ago
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆55Updated 3 weeks ago
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆26Updated 4 months ago
- Orpheus Chat WebUI☆65Updated 2 months ago
- Function Calling Mistral 7B. Learn how to make functions call for open source LLMs.☆48Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 6 months ago
- Whisper from OpenAi and diarization with Pyannote☆44Updated last year
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆132Updated last year
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆62Updated 8 months ago
- ☆21Updated 2 months ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆62Updated 3 months ago
- ☆205Updated last year
- A simple POC of FastRTC, a framework to use voice mode in python!☆29Updated 2 months ago
- MLX-based QA pair generator and LLM finetuning tool in Streamlit☆34Updated 6 months ago
- Faster Whisper with additional features☆44Updated 3 months ago
- Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat☆199Updated 3 months ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆56Updated last month
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆36Updated 11 months ago
- Deploy Apollo HF space locally☆40Updated 6 months ago
- ☆79Updated 3 months ago