JigsawStack / insanely-fast-whisper-api
An API to transcribe audio with OpenAI's Whisper Large v3!
☆254Updated 4 months ago
Alternatives and similar repositories for insanely-fast-whisper-api:
Users that are interested in insanely-fast-whisper-api are comparing it to the libraries listed below
- Examples for Cerebrium Serverless GPUs☆471Updated this week
- Live-Transcription (STT) with Whisper PoC☆175Updated 9 months ago
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆198Updated 9 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆115Updated 10 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆113Updated last year
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆89Updated last month
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆341Updated 9 months ago
- FastAPI service on top of WhisperX☆76Updated this week
- Joint speech-language model - respond directly to audio!☆370Updated 8 months ago
- A Fast TTS Engine☆472Updated 2 months ago
- Open source inference code for Rev's model☆383Updated 2 weeks ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆195Updated last month
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆205Updated 4 months ago
- Open source conversation framework and visual editor for structured Pipecat dialogues☆256Updated this week
- Real-Time Voice Inference Web SDK☆204Updated this week
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆104Updated 10 months ago
- ☆201Updated 9 months ago
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆483Updated 2 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated 11 months ago
- ☆91Updated 2 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆373Updated 6 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆126Updated 9 months ago
- 🔥 LitLytics - an affordable, simple analytics platform that leverages LLMs to automate data analysis☆97Updated 3 months ago
- Local SRT/LLM/TTS Voicechat☆644Updated 5 months ago
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆288Updated 8 months ago
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆70Updated last month
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆199Updated 5 months ago
- Whisper with Medusa heads☆821Updated 3 weeks ago
- ez audio transcription tool with flexible processing and post-processing options☆146Updated last year