arihanv / Shush
Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app
☆194Updated 7 months ago
Alternatives and similar repositories for Shush:
Users that are interested in Shush are comparing it to the libraries listed below
- Real-Time Voice Inference Web SDK☆181Updated 3 weeks ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆331Updated 7 months ago
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆284Updated 5 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆100Updated 10 months ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆187Updated 4 months ago
- ☆98Updated last year
- Open source conversation framework and visual editor for structured Pipecat dialogues☆94Updated this week
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆116Updated 4 months ago
- The Moshi speech-to-speech model, deployed to Modal with a realtime CLI chat☆56Updated 3 months ago
- Safely deploy OpenAI's Realtime APIs in less than 5 minutes!☆153Updated 3 months ago
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆188Updated 2 months ago
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆96Updated 8 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆224Updated 2 months ago
- ☆77Updated 10 months ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆184Updated 2 months ago
- Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat☆153Updated last month
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆90Updated 8 months ago
- The JavaScript client for the Cartesia API.☆62Updated last week
- open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.☆227Updated 8 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆110Updated 11 months ago
- ☆99Updated last year
- StoryTeller is an experimental web application that creates short audio stories for pre-school kids.☆83Updated 9 months ago
- Transcription with speaker diarization pipeline☆89Updated last year
- Groq-Powered Real-Time Voice Assistant☆206Updated 2 months ago
- Deepgram Conversational AI demo☆360Updated 2 weeks ago
- Create keyboard shortcuts for an LLM using OpenAI GPT, Ollama, HuggingFace with Automator on macOS.☆142Updated 10 months ago
- ☆44Updated 3 months ago
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆471Updated 4 months ago