JigsawStack / insanely-fast-whisper-api
An API to transcribe audio with OpenAI's Whisper Large v3!
☆166Updated 3 weeks ago
Related projects: ⓘ
- Real-Time Voice Inference Web SDK☆126Updated this week
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆151Updated last week
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆109Updated 3 months ago
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆171Updated 3 months ago
- ☆419Updated this week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆81Updated 4 months ago
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆60Updated this week
- Generate ideal question-answers for testing RAG☆122Updated 2 months ago
- Live-Transcription (STT) with Whisper PoC☆140Updated 3 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆97Updated 7 months ago
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆274Updated last month
- SemanticFinder - frontend-only live semantic search with transformers.js☆211Updated last week
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆412Updated 3 weeks ago
- Llama3.1 learns to Listen☆134Updated this week
- ☆100Updated this week
- ☆166Updated 9 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆134Updated 3 weeks ago
- Generate accurate transcripts using Apple's MLX framework☆154Updated last week
- Joint speech-language model - respond directly to audio!☆312Updated 2 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆188Updated last month
- ☆218Updated last month
- Chat with any website on your local machine☆71Updated 2 months ago
- Minimalist web-searching app with an AI assistant that runs directly from your browser. Uses Web-LLM, Wllama and SearXNG. Demo: https://f…☆108Updated this week
- Action library for AI Agent☆187Updated this week
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆78Updated 6 months ago
- The creative suite for character-driven AI experiences.☆176Updated last week
- open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.☆215Updated 4 months ago
- Local SRT/LLM/TTS Voicechat☆471Updated last month
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆151Updated 5 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆51Updated 8 months ago