arihanv / ShushLinks
Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app
☆205Updated 11 months ago
Alternatives and similar repositories for Shush
Users that are interested in Shush are comparing it to the libraries listed below
Sorting:
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆352Updated 11 months ago
- Real-Time Voice Inference Web SDK☆238Updated 3 weeks ago
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆244Updated 7 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆101Updated last year
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆212Updated 3 months ago
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆289Updated 10 months ago
- A browser extension that lets you chat with YouTube videos using Llama2-7b. Built using 🤗 Inference Endpoints and Vercel's AI SDK.☆163Updated last year
- Deepgram Conversational AI demo☆388Updated 3 weeks ago
- The JavaScript client for the Cartesia API.☆97Updated 3 weeks ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆277Updated 6 months ago
- TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.☆69Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆213Updated 7 months ago
- Chrome extension to chat with page using local LLM (llama, mistral 7B, etc)☆177Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆116Updated last year
- Transcription with speaker diarization pipeline☆93Updated 2 years ago
- ☆99Updated last year
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆132Updated 11 months ago
- Talk to GPT-4 and create a story together.☆90Updated last year
- The Moshi speech-to-speech model, deployed to Modal with a realtime CLI chat☆57Updated 8 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 5 months ago
- ☆89Updated last year
- ☆99Updated 7 months ago
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆215Updated last week
- Groq-Powered Real-Time Voice Assistant☆220Updated 7 months ago
- Safely deploy OpenAI's Realtime APIs in less than 5 minutes!☆155Updated 8 months ago
- A curated list of awesome OpenAI's Whisper☆101Updated last year
- Examples for Cerebrium Serverless GPUs☆487Updated 2 weeks ago
- Build robust, production grade function calling assistants that work. Declarative and extensible. Built on top of LangChain ⚡️☆77Updated last year
- Get started using Deepgram's Live Transcription with this Next.js demo app☆213Updated this week
- A simple voice assistant example built with Next.js and LiveKit React Components☆186Updated this week