arihanv / ShushLinks
Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app
☆211Updated last year
Alternatives and similar repositories for Shush
Users that are interested in Shush are comparing it to the libraries listed below
Sorting:
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆215Updated 4 months ago
- Real-Time Voice Inference Web SDK☆254Updated this week
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆292Updated 11 months ago
- Chrome extension to chat with page using local LLM (llama, mistral 7B, etc)☆179Updated last year
- Deepgram Conversational AI demo☆391Updated last month
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆360Updated last year
- A browser extension that lets you chat with YouTube videos using Llama2-7b. Built using 🤗 Inference Endpoints and Vercel's AI SDK.☆161Updated last year
- The JavaScript client for the Cartesia API.☆102Updated 2 weeks ago
- ☆99Updated last year
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆243Updated 8 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆104Updated last year
- Create keyboard shortcuts for an LLM using OpenAI GPT, Ollama, HuggingFace with Automator on macOS.☆152Updated last year
- An API to transcribe audio with OpenAI's Whisper Large v3!☆292Updated 8 months ago
- Talk to GPT-4 and create a story together.☆90Updated last year
- ☆94Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Groq-Powered Real-Time Voice Assistant☆222Updated 8 months ago
- Transcription with speaker diarization pipeline☆94Updated 2 years ago
- ☆89Updated last year
- faster-whisper as serverless endpoint☆108Updated last month
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆212Updated 3 weeks ago
- StoryTeller is an experimental web application that creates short audio stories for pre-school kids.☆89Updated last year
- Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat☆202Updated 3 months ago
- Replicate Flux LoRA image editor.☆51Updated 10 months ago
- ☆368Updated last year
- A Function Calls Proxy for Groq, the fastest AI alive!☆201Updated last year
- Play with OpenAI's new Realtime API in your browser☆329Updated 7 months ago
- Record a sample of your own voice and let AI narrate the text in your own voice.☆80Updated last year
- The Moshi speech-to-speech model, deployed to Modal with a realtime CLI chat☆57Updated 9 months ago
- ☆48Updated 9 months ago