pavelzbornik / whisperX-FastAPI
FastAPI service on top of WhisperX
☆41Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for whisperX-FastAPI
- Whisper realtime streaming for long speech-to-text transcription and translation☆103Updated 9 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆84Updated 6 months ago
- Live-Transcription (STT) with Whisper PoC☆155Updated 5 months ago
- Effortlessly record, transcribe, and summarize meetings with this user-friendly desktop utility powered by OpenAI's Whisper and GPT-3.5-t…☆166Updated last year
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆64Updated 2 months ago
- web based editor for subtitles and transcripts☆112Updated 3 months ago
- WhisperX Service love docker!☆11Updated 3 months ago
- Speech Diarization for scrum automation☆97Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆68Updated 6 months ago
- ez audio transcription tool with flexible processing and post-processing options☆130Updated 9 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆69Updated last month
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆32Updated 2 weeks ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆262Updated 2 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆37Updated 4 months ago
- ☆171Updated 11 months ago
- whisper.cpp bindings for python☆77Updated last year
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆63Updated this week
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆196Updated 3 weeks ago
- ☆296Updated 4 months ago
- ASR using OpenAI capability API `v1/audio/transcriptions` like Groq, SiliconFlow☆22Updated 2 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆75Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆54Updated 5 months ago
- Open source inference code for Rev's model☆333Updated last week
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆84Updated 2 weeks ago
- A lightweight end-to-end text-to-speech model☆91Updated 2 months ago
- ASR + diarization model server with speculative decoding☆50Updated 5 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆53Updated 10 months ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆122Updated last year
- Real time faster whisper gradio☆25Updated last month
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆78Updated 8 months ago