arihanv / ShushLinks
Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app
☆219Updated last year
Alternatives and similar repositories for Shush
Users that are interested in Shush are comparing it to the libraries listed below
Sorting:
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆293Updated last year
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆384Updated last year
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆230Updated 10 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆332Updated last year
- Real-Time Voice Inference Web SDK☆298Updated 3 weeks ago
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆249Updated 4 months ago
- ☆407Updated this week
- The Moshi speech-to-speech model, deployed to Modal with a realtime CLI chat☆59Updated last year
- A browser extension that lets you chat with YouTube videos using Llama2-7b. Built using 🤗 Inference Endpoints and Vercel's AI SDK.☆165Updated 2 years ago
- Chrome extension to chat with page using local LLM (llama, mistral 7B, etc)☆182Updated 2 years ago
- ☆383Updated last year
- ☆100Updated 2 years ago
- TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.☆72Updated 2 years ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆105Updated last year
- Groq-Powered Real-Time Voice Assistant☆226Updated last year
- Create keyboard shortcuts for an LLM using OpenAI GPT, Ollama, HuggingFace with Automator on macOS.☆153Updated last year
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆115Updated last year
- The JavaScript client for the Cartesia API.☆126Updated 2 months ago
- StoryTeller is an experimental web application that creates short audio stories for pre-school kids.☆93Updated last year
- ☆89Updated last year
- open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.☆241Updated last year
- ☆154Updated 2 years ago
- Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat☆224Updated 2 months ago
- ☆94Updated 2 years ago
- A sample web app using OpenAI Whisper to transcribe audio built on Next.js. It records audio continuously for some time interval then upl…☆179Updated 2 years ago
- Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming☆311Updated 6 months ago
- smol-podcaster is your podcast production agent 🎙️☆407Updated 2 months ago
- A spotify playlist agent using CrewAI☆82Updated last year
- ☆52Updated last year
- Prompt to ui for fun☆237Updated last year