arihanv / ShushLinks
Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app
☆216Updated last year
Alternatives and similar repositories for Shush
Users that are interested in Shush are comparing it to the libraries listed below
Sorting:
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆377Updated last year
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆293Updated last year
- Deepgram Conversational AI demo☆403Updated 2 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆303Updated 10 months ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆223Updated 7 months ago
- The JavaScript client for the Cartesia API.☆112Updated 3 weeks ago
- Real-Time Voice Inference Web SDK☆289Updated last week
- Chrome extension to chat with page using local LLM (llama, mistral 7B, etc)☆180Updated last year
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆246Updated 3 weeks ago
- Groq-Powered Real-Time Voice Assistant☆223Updated 11 months ago
- ☆99Updated last year
- A browser extension that lets you chat with YouTube videos using Llama2-7b. Built using 🤗 Inference Endpoints and Vercel's AI SDK.☆163Updated 2 years ago
- ☆50Updated last year
- StoryTeller is an experimental web application that creates short audio stories for pre-school kids.☆92Updated last year
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆113Updated last year
- A Function Calls Proxy for Groq, the fastest AI alive!☆204Updated last year
- Play with OpenAI's new Realtime API in your browser☆337Updated 2 weeks ago
- ☆94Updated last year
- The Moshi speech-to-speech model, deployed to Modal with a realtime CLI chat☆59Updated last year
- Demo of AI chatbot that predicts user message to generate response quickly.☆104Updated last year
- TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.☆71Updated last year
- Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat☆221Updated 6 months ago
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆216Updated last month
- Create keyboard shortcuts for an LLM using OpenAI GPT, Ollama, HuggingFace with Automator on macOS.☆153Updated last year
- Talk to GPT-4 and create a story together.☆91Updated last year
- open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.☆239Updated last year
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆498Updated last month
- Build robust, production grade function calling assistants that work. Declarative and extensible. Built on top of LangChain ⚡️☆77Updated last year
- ☆89Updated last year
- Transcription with speaker diarization pipeline☆94Updated 2 years ago