arihanv / Shush
Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app
☆198Updated 9 months ago
Alternatives and similar repositories for Shush:
Users that are interested in Shush are comparing it to the libraries listed below
- Real-Time Voice Inference Web SDK☆204Updated this week
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆288Updated 8 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆101Updated last year
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆195Updated last month
- An API to transcribe audio with OpenAI's Whisper Large v3!☆254Updated 4 months ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆341Updated 9 months ago
- The Moshi speech-to-speech model, deployed to Modal with a realtime CLI chat☆56Updated 6 months ago
- Safely deploy OpenAI's Realtime APIs in less than 5 minutes!☆155Updated 5 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- ☆99Updated last year
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆221Updated 4 months ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆199Updated 5 months ago
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆104Updated 10 months ago
- Transcription with speaker diarization pipeline☆90Updated last year
- open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.☆229Updated 10 months ago
- Real-Time Transcription Using OpenAI Whisper☆112Updated 3 weeks ago
- ☆45Updated 6 months ago
- TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.☆67Updated last year
- StoryTeller is an experimental web application that creates short audio stories for pre-school kids.☆86Updated 11 months ago
- The JavaScript client for the Cartesia API.☆91Updated last week
- ☆77Updated last year
- Create keyboard shortcuts for an LLM using OpenAI GPT, Ollama, HuggingFace with Automator on macOS.☆147Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆113Updated last year
- Live-Transcription (STT) with Whisper PoC☆175Updated 9 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆43Updated 3 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆115Updated 10 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆205Updated 4 months ago
- Chat interface that searches the web for you real-time☆89Updated 5 months ago
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆89Updated last month
- Summarize, Verify & Chat with any YouTube video in seconds.☆166Updated last month